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Description 

BACKGROUND OF THE INVENTION 

1. Field of the invention 

[0001 ] The present invention relates to an optical disc 
tor recording still picture data and audio data to be pro- 
duced simultaneously with the still picture, an apparatus 
and a method for recording such an optical disc, and an 
apparatus and a method for reproducing such an optical 
disc. 

2. Description of the prior art 
Digital cameras 

[0002] Digital cameras for capturing still pictures us- 
ing the JPEG compression format, formally known as 
the ISO/I EC 10918-1 standard, have become widely 
available in the last few years. One reason for the grow- 
ing popularity of digital cameras is the improved audio- 
visual (AV) processing capability of modern personal 
computers (PC). 

[0003] Images captured by a digital camera can be 
transferred by various means, including semiconductor 
memory, floppy disk, and infrared communications, to a 
PC in a format enabling the image data to be processed 
and manipulated on the PC. The captured image data 
can then be edited on the PC for use by presentation 
programs, word processors, and by Internet content 
providers. 

[0004] Digital cameras enabling audio to be captured 
with still pictures have been more recently introduced. 
This ability to capture sound with still pictures has 
helped to further differentiate the digital camera from 
conventional film-based still cameras. 
[0005] Fig. 7 shows the relationship between still pic- 
ture data (JPEG data) and audio data recorded by such 
a digital camera. As shown in Fig. 7, the still picture data 
(JPEG data) and audio data are stored in separate files. 
Each time a picture is taken (recorded), separate JPEG 
data and audio data files are created. 
[0006] There are two basic methods for managing the 
relationship between still picture data (JPEG data) and 
audio data files. The first, as shown in Fig. 7(a), uses a 
link manager to maintain the relationship (link) between 
a JPEG data file and the associated audio data file. The 
other, as shown in Fig. 7(b), assigns the same root file 
name (the part of the file name minus the extension, e. 
g., "xyz" in Fig. 7(b)) to both the JPEG data file and the 
audio data file. 

[0007] Using either of the above-described methods, 
an audio data file can be linked to a particular still picture 
data file when the picture is captured, and can be 
changed during any later editing process. That is, if the 
user decides after taking a picture that the audio asso- 
ciated with that picture is inappropriate or undesirable, 



different audio data can be selected and linked to the 
image data on the PC. 

[0008] The advent of MPEG (Moving Picture Experts 
Group) standards for handling audio-video data contain- 

5 ing moving and still pictures together with audio has also 
accelerated the development of multimedia products 
and services based on MPEG standards. 
[0009] When image data and audio are recorded us- 
ing the MPEG standard, the audio stream and video 

10 stream are multiplexed and recorded as a single system 
stream as shown in Fig. 6(c). This makes it very difficult 
to freely change the audio stream associated with a par- 
ticular video stream after the initial recording. More spe- 
cifically, to change the audio data recorded for a partic- 

15 ular still picture, the still picture data and audio data must 
be edited together as a single MPEG system stream. 
This means that the MPEG system stream must first be 
decoded, and the extracted still picture data and audio 
data must then be re-encoded as a single system 

20 stream after editing. Editing the still picture data and au- 
dio data after recording is therefore much more difficult 
than it is with a conventional digital camera. 
[0010] The US patent 5,481 ,543 is concerned with a 
technical teaching for decoding MPEG data. The meth- 

2S od is directed towards the generation of a bit stream by 
multiplexing non-compressed auxiliary information with 
an information stream. The information stream is ob- 
tained by compressing fixed-size units of an information 
signal with a variable compression ratio to provide var- 

30 ying-sized units of the information stream. The auxiliary 
information is for use in subsequently processing the in- 
formation stream. The above described problem is also 
inherent in this known technique. 
[001 1] The European patent application EP 0 755 1 62 

35 is directed towards a still picture camera and disc for 
recording a still picture with voice data. According to this 
citation the picture data are to be stored in the JPEG 
format. The JPEG format provide no means for easily 
changing audio data which have been synchronized or 

40 associated with a particular video data once the record- 
ing has been done. 

SUMMARY OF THE INVENTION 

45 [0012] An object of the present invention is therefore 
to provide a recording medium, an apparatus, and a 
method whereby audio data presented with a still picture 
can be freely and easily changed even after recording 
the still picture data using an MPEG standard format. 

50 [0013] To achieve the above object, a recording me- 
dium according to the present invention that is repro- 
ducible by reproducing apparatus having a decoder 
buffer, decoder and output section, has recorded there- 
on a video part stream (ST1) comprising a plurality of 

55 units containing still picture data for one picture, and an 
audio part stream (ST2) comprising one or a plurality of 
units containing audio data to be reproduced with the 
still picture data; wherein said units store time stamp in- 
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formation indicative of a time required for a decoding 
process and the time at which the data is to be output, 
said time stamp information includes a time SCR2 in- 
dicative of a time at which the last unit in the video part 
stream is to be input to a decoder buffer, and a time 
SCR3 indicative of a time at which the first unit in the 
audio part stream is to be input to a decoder buffer, and 
said times SCR2 and SCR3 are defined to satisfy the 
following relation: SCR2 + Tp < SCR3, where Tp is the 
time required from the start to the end of inputting one 
unit of the video part stream (ST1) to a decoder buffer, 
wherein each unit recorded on said disc indicates a con- 
tiguous series of data on a track. 
[001 4] By the above arrangement, the second system 
stream carrying the audio data is stored in the optical 
disc independently of the first system stream. Thus, the 
data in the second system stream can be easily revised. 
[0015] Preferably, the time stamp information further 
includes a time SCR1 at which the first unit in the first 
system stream is input to a decoder buffer. In this case, 
times SCR1 and SCR2 are defined as: 

SCR1 =0 



SCR2 + Tp < 27000000 (27 MHz) 

where (27 MHz) indicates that the numeric value shown 
therebetore is a count of a 27 MHz clock. 
[0016] By this arrangement, the time period for trans- 
ferring the first system stream completely to the decoder 
buffer can be set to 1 second or less. 
[0017] Yet further preferably in this case, time SCR3 
is defined as SCR3 = 27000000 (27 MHz). 
[0018] By this arrangement, the transfer start time of 
the second system stream to the decoder buffer can be 
set to 1 second after the start transfer time of the first 
system stream to the decoder buffer. 
[0019] Yet further preferably, the time stamp informa- 
tion also includes a time PTS1 at which the first system 
stream is presented from the output section, and a time 
PTS3 at which the second system stream is output from 
the decoder. In this case, times PTS1 and PTS3are the 
same. 

[0020] By this arrangement, the still picture produced 
by the first system stream and the sound produced by 
the second system stream can be effected simultane- 
ously. 

[0021] Yet further preferably, the time stamp informa- 
tion also includes a decoding start time DTS1 at which 
a decoder starts decoding the first system stream. This 
time DTS1 is defined as: 

DTS1 = 90000 (90 kHz) 
where (90 kHz) indicates that the numeric value shown 



therebefore is a count of a 90 kHz clock. 
[0022] By this arrangement, the decode start time of 
the second system stream can be set to 1 second after 
the start transfer of the first system stream to the decod- 
5 er buffer. 

[0023] In this case, times PTS1 and PTS3 are prefer- 
ably defined by the equation: 

1Q PTS1 = PTS3 = 90000 (90 kHz) + Tv 

where (90 kHz) indicates that the numeric value shown 
therebefore is a count of a 90 kHz clock, and Tv is the 
video data frame period. 

is [0024] By this arrangement, the presentation of the 
still picture and the sound can be done after 1 second 
plus 1 frame period Tv from the start transfer of the first 
system stream to the decoder buffer. 
[0025] First and second system stream management 

20 information (Volume information) is further preferably 
recorded to an optical disc according to the present in- 
vention, and the management information for the first 
system stream includes an identification flag 
(Audio_Flag) for declaring there is audio data to be re- 

25 produced synchronized with the still picture data. 

[0026] By this identification flag, it is possible to detect 
whether or not the sound accompanies the still picture. 
[0027] The object of the present invention is also 
solved by an optical disc recording apparatus for record- 

30 jng a system stream containing still picture data and au- 
dio data to be reproduced with the still picture data to 
an optical disc that is reproducible by a reproducing ap- 
paratus having a decoder buffer, decoder, and output 
section, said optical disc recording apparatus compris- 
es jng: an encoder, and a system controller; said encoder 
generating a video part stream (ST1 ) comprising a plu- 
rality of units containing still picture data for one picture, 
and an audio part stream (ST2) comprising one or a plu- 
rality of units containing audio data to be reproduced 

40 with the still picture data; said encoder storing in said 
units time stamp information indicative of a time required 
for a decoding process and the time at which the data 
is to be output; wherein the time stamp information in- 
cludes a time SCR2 indicative of a time at which the last 

45 unit in the video part stream is to be input to a decoder 
buffer, and a time SCR3 indicative of a time at which the 
first unit in the audio part stream is to be input to a de- 
coder buffer, and said times SCR2 and SCR3 are de- 
fined to satisfy the following relation: SCR2 + Tp < 

so SCR3, where Tp is the time required from the start to 
the end of inputting one unit to a decoder buffer, wherein 
each unit recorded on said disc indicates a contiguous 
series of data on a track. 

[0028] By the above arrangement, the second system 
55 stream carrying the audio data is stored in the optical 
disc independently of the first system stream. Thus, the 
data in the second system stream can be easily revised. 
[0029] The system controller of this optical disc re- 
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cording apparatus further preferably stores as time 
stamp information a time SCR1 at which the first unit in 
the first system stream is input to a decoder buffer, and 
a time PTS1 at which the first system stream is output 
from the output section. These times SCR1 , SCR2, and 
PTS1 are defined as: 

SCR1 = 0 



SCR2 < 27000000 (27 MHz) - Tp 



PTS1 = 90000 (90 kHz) + Tv 

where (27 MHz) indicates that the numeric value shown 
therebefore is a count of a 27 MHz clock, (90 kHz) indi- 
cates that the numeric value shown therebefore is a 
count of a 90 kHz clock, Tp is the time required to trans- 
fer the last unit of the first system stream, and Tv is the 
video data frame period. 

[0030] By this arrangement, the time for start trans- 
ferring the first system stream to the decoder buffer is 
set to 0, the time for finish transferring the first system 
stream to the decoder buffer is set to 1 second or less, 
and the time for displaying or presenting the still picture 
is set to 1 second plus 1 frame period Tv from the start 
transfer of the first system stream to the decoder buffer. 
[0031] Further preferably, the system controller fur- 
ther stores as time stamp information a time PTS3 at 
which the second system stream is output from the de- 
coder. In this case, times SCR3 and PTS3 are defined 
as: 

SCR3 = 27000000 (27 MHz) 



PTS3 = 90000 (90 kHz) + Tv. 

[0032] By this arrangement, the time for transferring 
the second system stream to the decoder buffer can be 
set to 1 second from the strati transfer of the first system 
stream, and the time for decoding and reproducing the 
sound can be set to 1 second plus 1 frame period Tv. 
[0033] The system controller further preferably gen- 
erates first and second system stream management in- 
formation, and stores in the management information for 
the first system stream an identification flag 
(Audio^Flag). This flag is used for declaring whether 
there is audio data to be reproduced synchronized with 
the still picture data. 

[0034] By this identification flag, it is possible to detect 
whether or not the sound accompanies the still picture. 
[0035] The system controller yet further preferably 
records audio data reproduction time 
(CelLPIayback_Time) in the management information 



for the second system stream. 

[0036] By this arrangement, it is possible to set the 

sound reproducing time. 

[0037] The present invention also provides an optical 
5 disc recording method for recording a system stream 
containing still picture data and a separate system 
stream containing audio data to be reproduced with the 
still picture data to an optical disc according to the 
present invention as defined in claim 1 3. 

10 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0038] The present invention will become more fully 
understood from the detailed description given below 
*5 and the accompanying diagrams wherein: 

Fig. 1 is a block diagram of a DVD recording appa- 
ratus drive; 

Figs. 2(a) and 2(b) show the relationship between 
20 address space on a disc and the amount of data 
stored in the track buffer; 

Figs. 3(a) and 3(b) show the correlation between I, 
B, and P pictures in an MPEG video stream; 
Fig. 4 shows the structure of an MPEG system 
25 stream; 

Fig. 5 is a block diagram of an MPEG system stream 
decoder (P_STD); 

Figs. 6(a), 6(b), 6(c) and 6(d) show video data, the 
change in the amount of data stored to the video 
30 buffer, a typical MPEG system stream, and an audio 
data signal, respectively, according to prior art; 
Figs. 7(a) and 7(b) illustrate links between still pic- 
tures and audio in a digital still camera, according 
to prior art; 

35 -Figs. 8(a) and 8(b) are diagrams showing two dif- 
ferent styles of a directory structure and the physical 
arrangement of the disc recording surface; 
Figs. 9(a) and 9(b) show the structure of a manage- 
ment information file, and the data stream; 

40 Figs. 1 0(a), 1 0(b) and 1 0(c) show the management 
information data for still picture data and audio data, 
a data stream for the still picture data and audio- 
data, and another data stream for the still picture 
data and audio data; 

45 Figs. 11(a), 11(b) and 11(c) are diagrams showing 
a still picture data VOB, an audio data VOB, and a 
combined VOB, according to the present invention; 
Fig. 12 is a block diagram of a DVD recording ap- 
paratus; 

50 Fig. 1 3 is a flow chart of a recording process of the 
DVD recording apparatus shown in Fig. 12; 
Fig. 14 is a flow chart of the still picture data VOB 
generating process shown as step S1 301 in Fig. 1 3 
in the DVD recording apparatus shown in Fig. 12; 

55 Fig. 15 is a flow chart of the audio data VOB gen- 
erating process shown as step S1303 in Fig. 13 in 
the DVD recording apparatus shown in Fig. 12; 
Fig. 16 is a flow chart of the management informa- 



4 



7 



EP 0 949 825 B1 



8 



tion file generating process shown as step S1304 
in Fig. 1 3 in the DVD recording apparatus shown in 
Fig. 12; 

Figs. 17(a) and 17(b) are explanatory views show- 
ing two still pictures; 

Figs. 18(a), 18(b), 18(c), 18(d) and 18(e) are dia- 
grams showing an operation according to the prior 
art to reproduce a still picture with an audio data; 
Figs. 19(a), 19(b), 19(c), 19(d) and 19(e) are dia- 
grams showing an operation according to the 
present invention to reproduce a still picture with 
single audio data; and 

Figs. 20(a), 20(b), 20(c), 20(d) and 20(e) are dia- 
grams showing an operation according to the 
present invention to reproduce a still picture with du- 
al audio data. 

DESCRIPTION OF PREFERRED EMBODIMENTS 

[0039] The preferred embodiments of the present in- 
vention are described below with reference to the ac- 
companying figures. 

[0040] A preferred embodiment of the present- inven- 
tion is described below with reference to a DVD record- 
ing apparatus using DVD-RAM as the MPEG stream re- 
cording medium. 

1. Overview of a normal MPEG stream 

[0041] A normal MPEG stream of audio-video data is 
described first below. The structure of the MPEG stream 
will be known to those with ordinary knowledge of the 
related art, and the following description therefore fo- 
cuses on those parts having a particular relationship to 
the present invention. 

[0042] As previously noted above, the MPEG stand- 
ard defines an audio-video compression method that 
has been formalized as the ISO/lEC 1 381 8 international 
standard. 

[0043] The MPEG standard achieves high efficiency 
data compression primarily by means of the following 
two features. 

[0044] First, moving picture data is compressed using 
a combination of conventional intraframe compression 
using a spatial frequency characteristic to remove intra- 
frame redundancy, and interframe compression using 
temporal correlations between frames to remove redun- 
dancy in adjacent frames. Even more specifically the 
MPEG standard compresses moving picture data by 
first categorizing each frame (also referred to as a pic- 
ture in MPEG parlance) as an I picture (intra-coded 
frame), P picture (a predictive-coded frame that is coded 
with reference to a preceding picture), or a B picture (a 
bidirectionally predictive-coded frame that is coded with 
reference to both a preceding and a following picture). 
[0045] The relationship between I, P, and B pictures 
is shown in Fig. 3. As will be known from Fig. 3, P pic- 
tures are coded with reference to the closest preceding 



I or P picture, and B pictures are coded with reference 
to the closest preceding and following I or P pictures. As 
also shown in Fig. 3, the picture display order and the 
coding order of the compressed data are different be- 

5 cause each B picture is also dependent on an I or P pic- 
ture that is presented after the B picture. 
[0046] The second feature of MPEG compression is 
dynamic (coding) data allocation by picture unit based 
on image complexity. An MPEG decoder has an input 

10 buffer for storing the input data stream, thus enabling a 
large (coding) data size (that is, more data) to be allo- 
cated to complicated images that are more difficult to 
compress. 

[0047] MPEG also supports MPEG audio, a separate 

is MPEG encoding standard for audio data to be repro- 
duced with moving picture data. In addition, however, 
MPEG also supports the use of various other types of 
audio encoding for specific applications. 
[0048] The present invention allows for two types of 

20 audio data encoding, that is, encoding with data com- 
pression and encoding without data compression. Ex- 
emplary audio encoding methods with data compres- 
sion include MPEG audio and Dolby(R) Digital (AC-3); 
linear pulse code modulation (LPCM) is typical of audio 

25 encoding without data compression. Both AC-3 and 
LPCM are fixed bit rate coding methods. MPEG audio 
can select from among several different bit rates on an 
audio frame unit basis, although the range of bit rates is 
not as great as that available for video stream coding. 

30 [0049] The MPEG system then multiplexes the en- 
coded moving picture data and audio data into a single 
stream, which is referred to as the MPEG system 
stream. This multiplexed moving picture data and audio 
data is commonly referred to as AV data. 

35 [0050] The structure of the MPEG system stream is 
shown in Fig. 4. As shown in Fig. 4, the MPEG system 
stream is a hierarchical structure of packs and packets 
containing a pack header 41 , packet header 42, and 
payload 43. 

40 [0051] The packet is the smallest multiplexing unit, 
and the pack is the smallest data transfer unit. 
[0052] Each packet comprises a packet header 42 
and payload 43. AV data is divided into segments of an 
appropriate size starting from the beginning of the AV 

45 data stream, and these data segments are stored in the 
payload 43. The packet header 42 contains a stream ID 
for identifying the type of data stored to the payload 43, 
and a time stamp used for reproducing the data con- 
tained in the payload 43. This time stamp is expressed 

50 with 90 kHz precision. Data types identified by the 
stream ID include moving picture and audio. The time 
stamp includes both a decoding time stamp DTS and 
presentation time stamp PTS. The decoding time stamp 
DTS is omitted when decoding and presentation occur 

55 simultaneously, as with audio data. 

[0053] A pack typically contains a plurality of packets. 
In this preferred embodiment of the present, invention, 
however, one pack contains one packet. Thus, one pack 
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comprises pack header 41 and one packet (comprising 
packet header 42 and payload 43) as shown in Fig. 4. 
[0054] The pack header 41 contains a system clock 
reference SCR expressing with 27 MHz precision the 
time at which the data in that pack is input to the decoder s 
buffer. 

[0055] A decoder for decoding the above-noted 
MPEG system stream is described next below. 
[0056] Fig. 5 is a block diagram of a model MPEG sys- 
tem decoder (P_STD), particularly showing the detail of 
decoder 16. Shown in Fig. 5 are: a system controller 51 
with a system time clock STC, an internal reference 
clock for the decoder; a demultiplexer 52 for demulti- 
plexing, that is, decoding, the system stream; a video 
decoder input buffer 53; video decoder 54; a re-ordering 
buffer 55 for temporarily storing I and P pictures in order 
to absorb the delay between the display order and de- 
coding order that occurs between I and P pictures and 
the dependent B pictures; a switch 56 for adjusting the 
output sequence of the I, P, and B pictures in the re- 
ordering buffer 55; an audio decoder input buffer 57; and 
an audio decoder 58. 

[0057] The operation of this MPEG system decoder 
when processing an MPEG system stream is described 
next. 

[0058] When the time indicated by the STC 51 match- 
es the system clock reference SCR recorded in a pack 
header, the corresponding pack must be input to the de- 
multiplexer 52. Note that the STC 51 is initialized to the 
system clock reference SCR at the first pack in the sys- 
tem stream. The demultiplexer 52 then interprets the 
stream ID in the packet header, and transfers the pay- 
load data to the decoder buffer appropriate to each 
stream. The demultiplexer 52 also extracts the presen- 
tation time stamp PTS and decoding time stamp DTS. 
When the time indicated by the STC 51 and the decod- 
ing time stamp DTS match, the video decoder 54 reads 
and decodes the picture data from the video buffer 53. 
If the decoded picture is a B picture, the video decoder 
54 presents the picture. If the decoded picture is an I or 
P picture, the video decoder 54 temporarily stores the 
picture to the re-ordering buffer 55 before presenting the 
picture. 

[0059] The switch 56 corrects the difference between 
the decoding sequence and the presentation sequence 
as described above with reference to Fig. 3. That is, if 
a B picture is output from the video decoder 54, the 
switch 56 is set to pass the video decoder 54 output di- 
rectly from the system decoder. If an I or P picture is 
output from the video decoder 54, the switch 56 is set 
to output the output from the re-ordering buffer 55 from 
the system decoder. 

[0060] It should be noted that I pictures cannot be si- 
multaneously decoded and presented because the pic- 
ture sequence must be reordered to correct the differ- 
ences between the decoding order and the display or- 
der. Even if no B pictures are present in the system 
stream, there is a delay of one picture, that is, one video 



frame period, between picture decoding and presenta- 
tion. 

[0061 ] Similarly to the video decoder 54, the audio de- 
coder 58 also reads and decodes one audio frame of 
data from the audio buffer 57 when the time indicated 
by the STC 51 and the presentation time stamp PTS 
match (note that there is no decoding time stamp DTS 
present in the audio stream). 

[0062] MPEG system stream multiplexing is de- 
scribed next with reference to Fig. 6. Fig. 6 (a) shows 
several video frames, Fig. 6 (b) represents the video 
buffer state, Fig. 6 (c) shows the MPEG system stream, 
and Fig. 6 (d) shows the audio signal (audio data). The 
horizontal axis in each figure represents the time base, 
which is the same in each figure. The vertical axis in Fig. 
6 (b) indicates how much data is stored in the video buff- 
er at any given time; the solid line in Fig. 6 (b) indicates 
the change in the buffered data over time. The slope of 
the solid line corresponds to the video bit rate, and in- 
dicates that data is input to the buffer at a constant rate. 
The drop in buffered data at a regular period indicates 
that the data was decoded. The intersections between 
the diagonal dotted lines and the time base indicate the 
time at which video frame transfer to the video buffer 
starts. 

2. Problems with a conventional MPEG stream 

[0063] Digital cameras using a conventional MPEG 
stream as described above are not believed to be pres- 
ently available as commercial products because of the 
problems described below. For the convenience of the 
following description, however, it is herein assumed that 
this hypothetical digital camera exists. 
[0064] The relationship between the reproduction op- 
eration of an MPEG stream decoder in this hypothetical 
digital camera and the various time stamps (STC, PTS, 
DTS) is described first with reference to Figs. 1 7 and 1 8. 
Note that the decoder is assumed to be comprised as 
shown in Fig. 5. 

[0065] Fig. 17 is used to describe the operation for 
reproducing data captured by the digital camera on a 
personal computer (PC). An exemplary screen present- 
ed on the PC display is shown in Fig. 17 (a). Photo #1 
and photo #2 represent separate image files displayed 
on the screen in the form of icons. In a graphical user 
interface (GUI) such as Windows 95 (R), photos #1 and 
#2 may be presented as thumbnail sketches, for exam- 
ples, which a user can click on using a mouse or other 
pointing device. The PC then presents the file content 
corresponding to the photograph that was clicked on by 
displaying the image on screen and outputting the audio 
from a speaker connected to the PC. Fig. 17 (b) shows 
the content displayed for photo #1 and photo #2 in this 
example. 

[0066] When a user clicks on photo #1 in Fig. 17 (a) 
in this example, still picture #1 is presented on screen, 
and audio #1 is output from the PC speaker, as shown 
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in Fig. 17 (b). Likewise when the user clicks on photo 
#2, still picture #2 is presented on screen, and audio #2 
is output from the PC speaker. 
[0067] The relationship between decoder operation in 
this hypothetical digital camera and the various time 
stamps when photo #1 is reproduced is shown in Fig. 
18 and described below. 

[0068] The video output, still picture #1, and audio 
output, audio #1 , that are output for photo #1 are shown 
in Figs. 18(a) and 18(b). Figs. 18(c) and 18(d) show the 
change in the data stored to the video buffer 53 and au- 
dio buffer 57 as still picture #1 and audio #1 are decoded 
and output. Fig. 18(e) shows the pack sequence and 
time stamps (SCR, PTS, DTS) stored in each pack when 
photo #1 is stored to disc as stream #1, which is an 
MPEG stream in this example. 
[0069] It should be noted that while not shown in the 
figure, the DTS and PTS are stored in the packet header 
of each packet as described above. It will also be obvi- 
ous to one with ordinary skill in the related art that while 
only four video packs and two audio packs are shown 
for simplicity, there are actually more than 100 audio 
packs and video packs each because each pack is a 
maximum 2 KB. 

[0070] The reproduction operation of this hypothetical 
digital camera starts by sending the packs contained in 
stream #1 shown in Fig. 18(e) to the demultiplexer 52. 
[0071] As shown in Fig. 18(e), stream #1 is multi- 
plexed with the packs in the following sequence, starting 
from the beginning of the stream: video pack V1 , video 
pack V2, audio pack A1 , video pack V3, video pack V4, 
audio pack A2. The pack header of each pack contains 
a system clock reference SCR, which indicates the tim- 
ing at which that pack is input to the demultiplexer 52. 
In the example shown in Fig. 1 8, time t1 is stored to sys- 
tem clock reference SCR #1 of video pack V1 , time t2 
is stored to SCR #2 of video pack V2, time t3 is stored 
to SCR #3 of audio pack A1 , time t4 is stored to SCR 
#4 of video pack V3, time t5 is stored to SCR #5 of video 
pack V4, and time t6 is stored to SCR #6 of audio pack 
A2. 

[0072] The PTS and DTS are also written to the first 
pack of each picture. Time t7 is thus written to PTS #1 
of video pack V1 , and time t6 is written to DTS #1 . Note 
that the PTS and DTS are the same for every video pack 
in a picture, and are therefore not written to any but the 
first video pack. 

[0073] The PTS is written to every audio pack. There- 
fore, time t7 is written to PTS #1 for audio pack A1 , and 
time t9 is written to PTS #2 for audio pack A2. Note, 
further, that the PTS is written and the DTS is omitted 
in audio packs because the PTS and DTS are the same 
in an audio pack. The STC is reset at time t1 , the value 
of SCR #1 in video pack V1, that is, the first pack in 
stream #1 , and each pack in the stream #1 is then input 
to the demultiplexer 52 at the indicated by the SCR value 
in the pack header. 

[0074] Therefore, as shown in Fig. 18(e), video pack 



V1 is input to the demultiplexer 52 first at time t1 , then 
video pack V2 is input at time t2, audio pack A1 is input 
at time t3, video pack V3 is input at time t4, video pack 
V4 is input at time t5, and audio pack A2 is input at time 
5 tB. Video packs input to the demultiplexer 52 are then 
output to the video buffer 53, and audio packs are output 
to the audio buffer 57. 

[0075] The second part of the reproduction operation 
of this hypothetical digital camera described below is the 
10 data decoding and output operation of the video packs 
output to the video buffer 53. 

[0076] As shown in Fig. 18(c), while there is an ignor- 
able delay between the video packs output from the de- 
multiplexer 52, the video packs are accumulated to the 

15 video buffer 53 at the system clock reference SCR tim- 
ing, that is, at time t1 , t2, t4, and t5. Still picture #1 com- 
prises video packs V1 to V4. As a result, all video packs 
constituting still picture #1 have been stored to the video 
buffer 53 once video pack V4 has been stored to the 

20 video buffer 53. As shown in Fig. 18(e), the decoding 
time stamp DTS of still picture #1 comprising video 
packs V1 to V4 is time t6. The data accumulated to the 
video buffer 53 is therefore decoded by video decoder 
54 at time t6, and the data is cleared from the video buff- 

25 er, thereby increasing the available buffer capacity. 
[0077] The decoded video pack data of still picture #1 
is an I picture. The decoded I picture is stored to re-or- 
dering buffer 55, and is output from the decoder at PTS 
time t7. 

30 [0078] Note that the end presentation time for still pic- 
ture #1 is not defined by an MPEG stream time stamp. 
As a result, presentation typically ends when reproduc- 
tion of the next MPEG stream begins, or when video out- 
put is terminated by a control command sent to the de- 

35 coder from another application or device. The example 
shown in Fig. 18 therefore shows presentation of still 
picture #1 continuing even after time t10, the time at 
which audio output ends. 

[0079] The third part of the reproduction operation of 
40 this hypothetical digital camera described below is the 
relationship between the time stamps and the operation 
whereby audio pack data output to the audio buffer 57 
is decoded and output. 

[0080] As shown in Fig. 1 8 (d), the audio packs output 
45 from the demultiplexer 52 are stored to the audio buffer 
57 at time t3 and t8, thus increasing the amount of data 
stored to the audio buffer 57. Unlike the video data, the 
PTS and DTS are the same in the audio data. As a re- 
sult, audio data is output at the same time the audio de- 
50 coder 58 [57, sic, and below] decodes the audio pack 
data. More specifically, the audio pack A1 data stored 
to audio buffer 57 is decoded by audio decoder 58 at 
the presentation time stamp PTS, i.e., time t7, and audio 
output begins. The audio pack A2 data stored to the au- 
55 dio buffer 57 at time t8 is then decoded and output at 
the PTS, that is, time t9, by audio decoder 5B. 
[0081] The time that data can be stored to each de- 
coder buffer is also limited in the MPEG system. This 



13 



EP0 949 825 B1 



14 



limit is 1 sec. in the case of moving picture data. This 
means that the maximum difference between the trans- 
fer times of simultaneously output audio and video data, 
that is, the maximum SCR difference, is 1 second. How- 
ever, a delay equal to the time required to reorder the 
video data may also occur. 

3. MPEG stream problems 

[0082] Through years of research and development, 
the inventors have identified and organized problems 
presented by the conventional MPEG stream described 
above with respect to using the MPEG stream in a digital 
still picture camera. 

[0083] As noted above, the MPEG system stream 
contains video data and the audio data presented with 
that video data multiplexed into a single system stream. 
Editing this system stream to change the audio present- 
ed with a particular video image is therefore difficult 
once the audio and video streams have been multi- 
plexed into a single system stream. This means that 
when a digital camera uses an MPEG stream to encode 
and store a still picture and the audio recorded when 
that picture was taken to a recording medium, it is diffi- 
cult to later edit the audio to replace the audio recorded 
when the picture was taken with a different audio signal. 
[0084] Referring to the example shown in Fig. 17, 
when photo #1 is captured by a digital still picture cam- 
era, photo #1 is recorded by the camera to a disc or other 
recording medium as an MPEG stream multiplexing still 
picture #1, that is, the still picture data, and audio #1, 
that is, the audio data captured at the same time. The 
resulting MPEG stream thus comprises multiplexed vid- 
eo packs and audio packs as shown in Fig. 18(e). As a 
result, after the user takes a picture, it is difficult to 
change the audio data of photo #1 from audio #1 to a 
different audio signal. 

[0085] Though difficult, the following three methods of 
editing the audio data after recording are conceivable. 

(1) Generate a plurality of MPEG streams, each 
multiplexing the video data (the photographed still 
picture data) with one of a plurality of audio data 
streams that might be used with the video data, and 
record all of these plural MPEG streams to the re- 
cording medium. This method means that in the ex- 
ample shown in Fig. 1 8, a number of other streams, 
each containing the same video packs but a differ- 
ent selection of audio packs, is recorded in addition 
to stream #1 shown in Fig. 18(e). There is a limit to 
the number of MPEG streams that can be recorded, 
however, because the storage capacity of the re- 
cording medium is also limited. More particularly, 
however, it is not practically possible for the user to 
record at the time the picture is taken all audio data 
that might be desirably combined with the still pic- 
ture. 

(2) Decode the MPEG stream during editing to sep- 



arate the still picture data from the audio data, and 
then re-encode the system stream with the still pic- 
ture data and the new audio data. This method, 
however, requires system stream decoding and en- 

5 coding each time the audio is edited, thus increas- 
ing the required editing time. The entire system 
stream is also stored as decoded (uncompressed) 
data, thus requiring a large amount of memory in 
the digital camera. 

10 (3) Record the video stream and audio stream as 
two separate (unmultiplexed) streams, and deter- 
mine what audio stream to use with a particular vid- 
eo stream at the time of reproduction. This method 
makes it possible to add audio data after recording 

is a still picture to the recording medium, and then re- 
produce the added audio data when reproducing 
the still picture. 

[0086] The inventors of the present invention have 
20 used the above method (3). More specifically, the 
present invention provides a method and apparatus for 
reproducing two MPEG streams stored separately on 
disc as though they are a single MPEG stream using a 
conventional MPEG decoder. 

25 

MPEG stream according to the present invention 

[0087] To achieve the present invention by using a 
conventional decoder to reproduce two separate MPEG 
30 streams, one for still picture data and one for audio data, 
as noted above, it is necessary to drive the decoder to 
process the two MPEG streams as a single system 
stream. 

[0088] The first problem to be overcome in processing 
35 two MPEG streams as though they are a single system 
stream is that a discrete time stamp is assigned to the 
two streams. When the two streams are processed con- 
tinuously as one stream, contradictions such as a dis- 
continuity between the time stamps assigned to the two 
40 streams can occur. 

[0089] While the time stamps in the MPEG stream are 
multiplexed into the data, the initial time stamp value 
(the first system clock reference SCR in the stream) in 
a normal MPEG stream is not defined by the MPEG 
45 standard. In practice, therefore, the encoder assigns a 
specific value. 

[0090] It will therefore be obvious that there is no con- 
tinuity or correlation between the time stamps assigned 
to MPEG streams generated by different encoders. As- 

50 sume, for example, that encoder A generates an MPEG 
stream A encoded with an initial SCR of 0, and an en- 
coder B generates an MPEG stream B encoded with an 
initial SCR of 1000. The SCR of the last pack in stream 
A is 27000000 (27 MHz). Here, (27 MHz) indicates that 

55 the number preceding (27 MHz) is a counted value using 
27 MHz clock. Streams A and B are to be continuously 
processed by the decoder as a single stream. A discon- 
tinuity occurs in the SCR in this case between the end 
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of stream A and the beginning of stream B, and there is 
a strong possibility that the decoder hangs up or other 
error occurs. 

[0091] To resolve this problem, a recording apparatus 
according to the present invention limits the values of 
the time stamps (SCR, PTS, DTS) in the system 
streams that are generated and recorded to disc. 
[0092] The MPEG stream time code limits imposed 
by the present invention are described next below. 
[0093] Fig. 1 1 is referred to below to describe the time 
stamps used for the still picture data system stream ST1 
and the audio data system stream ST2 in this preferred 
embodiment of the present invention. 
[0094] Fig. 11(a) shows the structure of the system 
stream for still picture data, referred to as a video object 
(VOB). System clock reference SCR1 is written to the 
pack header of the first pack in the VOB, and PTS1 and 
DTS1 are written to the packet header of the first VOB. 
SCR2 is written to the pack header of the last pack. 
[0095] Fig. 11(b) shows the VOB for the audio data 
system stream ST2. SCR3 is written to the pack header 
of the first pack in this VOB, and PTS3 is written to the 
packet header 

[0096] Fig. 11 (c) shows the sequence in which the 
still picture data and audio data system streams are in- 
put continuously to the decoder during reproduction. 
[0097] In order to drive the decoder to process the still 
picture data system stream ST1 and audio data system 
stream ST2 as a single system stream, the values as- 
signed to the system clock reference SCR2 in the last 
pack ol the still picture data system stream ST1 , and the 
system clock reference SCR3 in the first pack of the au- 
dio data system-stream ST2, are limited as defined by 
equation (1) below in the present invention. 

SCR2+Tp^SCR3 (1) 

where Tp is the time required to transfer one pack to the 
decoder. More specifically, Tp is a time period from the 
moment when one pack starts to enter the demultiplexer 
52 until said one pack completely enters the demulti- 
plexer 52. Since the pack merely passes through the 
demultiplexer 52, it is also possible to say that Tp is a 
time period from the moment when one pack starts to 
enter the buffer 53 (or 57) unit said one pack completely 
enters the buffer 53 (or 57). 

[0098] It should be noted that equation (1) limits the 
smallest value that can be assigned to SCR3. SCR3 is 
often set to zero (0) in a conventional MPEG stream. A 
recording apparatus according to the present invention, 
however, calculates the SCR3 value from equation (1). 
[0099] By thus calculating the value of SCR3, SCR2 
is prevented from being greater than SCR3, and the 
SCR values in each pack of the still picture data system 
stream ST1 and audio data system stream ST2 are as- 
sured of being in a rising numerical sequence from one 
system stream to the next. 



[0100] Equation (1) also assures that the difference 
between SCR2 and SCR3 is at least Tp. This prevents 
the transfer timing of the first pack in the- audio data 
system stream ST2 from conflicting with the transfer of 
5 the last pack in the still picture data system stream ST1 , 
that is, transferring the first pack in the audio data sys- 
tem stream ST2 will not start while the last pack in the 
still picture data system stream ST1 is still being trans- 
ferred. 

10 [0101] It should be further noted that if the system 
stream transfer rate is 8 Mbps, the pack transfer time 
TP will be 55296 (27 MHz); if 10.08 Mbps, the pack 
transfer time Tp will be 43885 (27 MHz). 
[0102] A decoder according to the present invention 

is is further comprised to accept input of the audio data 
system stream ST2 following a still picture data system 
stream ST1 without first resetting the STC after input 
thereto of a still picture data MPEG stream has been 
completed. This is because it would be meaningless to 

20 limit the value of the SCR in the first audio stream pack 
if the decoder resets the STC after still picture data input, 
as it conventionally does after each system stream. 
[0103] By thus driving the decoder to process sup- 
plied system streams based on time stamp values cal- 

25 culated as described above, the decoder can handle 
separate stilt picture data and audio data system 
streams as a single MPEG stream. That is, a still picture 
data stream and a separately recorded audio data 
stream can be reproduced as though they are a single 

30 system stream. 

[0104] The presentation time stamps PTS1 and PTS3 
are also set to the same specified value as shown in 
equation (2) below. 

35 

PTS1 = PTS3 = specified value (2) 

[0105] This assures that both audio and still picture 
data output begin at the same time. 
40 [0106] In this exemplary embodiment of the present 
invention, this specified value is 

90000 (90 kHz) + Tv 

45 

where Tv is the video frame period, and (90 kHz) indi- 
cates that the number preceding (90 kHz) is a counted 
value using 90 kHz clock. In an NTSC signal, Tv is there- 
fore 3003; in a PAL signal, it is 3600. 

50 [0107] The time stamps shown in Fig. 11 are de- 
scribed more specifically below with reference to a case 
in which still picture data and audio output begin simul- 
taneously at approximately 1 second (90000 (90 kHz) + 
Tv) after data reading based on the time stamps calcu- 

55 lated from the above equations (1 ) and (2). 

[01 08] The time stamp for the still picture data VOB is 
described first. 
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(1 ) The system clock reference SCR (SCR1 ) for the 
first pack in the still picture data VOB is 0 (27 MHz). 

(2) The decoding time stamp DTS (DTS1) for the 
first pack in the still picture data VOB is 90000 (90 
kHz). Note that a still picture data VOB contains only 
one still picture. 

(3) The presentation time stamp PTS (PTS1 ) for the 
first pack in the still picture data VOB is 93003 (90 
kHz). Note that PTS1 = 93003 is for an NTSC video 
signal; for a PAL video signal, PTS1 = 93600. This 
is because the video frame period (Tv) in an NTSC 
signal is 3003, and is 3600 in a PAL signal. Note, 
further, that because a still picture data VOB con- 
tains only one still picture, all packs are output si- 
multaneously at the time indicated by PTS1 . 

(4) The SCR (SCR2) of the last pack in the still pic- 
ture data VOB is set to a value 27000000 (27 MHz) 
minus the transfer time of one pack (Tp). 

[01 09] The value 27000000 (27 MHz) is called a base 
value below. 

[01 1 0] This base value is set so that the longest delay 
between when moving picture data is input to the de- 
coder buffer and when it is decoded is 1 second 
(27000000 (27 MHz)). 

[0111] More specifically, if the maximum moving pic- 
ture data storage time is applied to still picture data, ail 
packs in the still picture data VOB must be transferred 
to the decoder within 1 second (27000000 (27 MHz)). If 
SCR1 for the first pack is 0, the data stored in the first 
pack will be decoded within 1 second (27000000 (27 
MHz)) after it is transferred to the decoder, and the SCR 
(SCR2) of the last pack in the same still picture data 
VOB is therefore pack transfer time Tp less than 
27000000 (27 MHz). 

[01 1 2] The PTS value and this base value are defined 
as described above to ensure encoder compatibility. In 
other words, if the still picture data system stream ST1 
and audio data system stream ST2 are encoded using 
the values derived from equations (1 ) and (2), the above 
specified PTS value, and the above base value, the 
present invention can be applied regardless of what en- 
coder generates the system streams. 
[0113] It should be noted that the base value is de- 
fined in this preferred embodiment as 27000000 (27 
MHz). The following equations (3) and (4) can therefore 
be derived where this base value is MaxT 

SCR2 + Tp < MaxT (3) 



SCR3=MaxT (4) 

[01 1 4] The time stamps of the audio data VOB are de- 
scribed next. 

(1 ) The system clock reference SCR (SCR3) of the 



first audio pack is 27000000 (27 MHz). Using this 
value, the audio pack will be input to the decoder 
continuously to the preceding still picture data VOB 
at the shortest time satisfying equation (1 ). In addi- 

5 tion, because the still picture data PTS1 is 93003 
(90 kHz), the SCR must be set to a smaller value in 
order to simultaneously output the audio. 
(2) The presentation time stamp PTS (PTS3) of the 
first audio frame in the VOB is 93003 (90 kHz). As 

10 noted above, this is for an NTSC video signal; if PAL 
video, PTS3 is 93600. 

[0115] It will also be obvious to one with ordinary skill 
in the related art that insofar as the still picture data VOB 

*5 and audio data VOB are encoded to satisfy equations 
(1 ) and (2), the present invention shall not be limited to 
the conditions (values) described above. 
[011 6] For example, if the video is an NTSC signal and 
the first SCR is 27000000 (27 MHz) rather than 0, the 

20 following values apply. 

SCR1 = 27000000 (= 1 sec) 



SCR2 < 53944704 (= SCR3 - Tp) 



SCR3 = 54000000 (= SCR1 + 1 sec) 

30 

PTS1 = PTS3 = 183003 (= DTS1 + 3003) 



DTS1 = 180000 (= 1 sec) 

[011 7] If the video is an NTSC signal, the first SCR is 
0, and PTS is 1 second, the following values apply. 

SCR1 =0 



SCR2 <; 26043804 (= SCR3 - Tp) 

45 

SCR3 = 26099100 (= 1 sec - 3003 x 300) 



PTS1 = PTS3 = 90000 (= 1 sec) 



DTS1 = 86997 (= PTS1 - 3003) 

55 [011 8] If the video is a PAL signal and the first SCR is 
27000000 (27 MHz), the following values apply. 



10 



19 



EP 0 949 825 B1 



20 



SCR1 = 27000000 (= 1 sec) 
SCR2 <; 53944704 (= SCR3 - Tp) 
SCR3 = 54000000 (= SCR1 + 1 sec) 
PTS1 = PTS3 = 183600 (= DTS1 + 3600) 

DTS1 = 180000 (=1 sec) 

[0119] If the video is a PAL signal, the first SCR is 0, 
and PTS is 1 second, the following values apply. 

SCR1 = 0 
SCR2 < 25864704 (= SCR3 - Tp) 
SCR3 = 25920000 (= 1 sec - 3600 x 300) 
PTS1 = PTS3 = 90000 (= 1 sec) 

DTS1 = 86400 (= PTS1 - 3600) 

[01 20] If the transfer rate is 1 0.08 Mops, and the video 
is an NTSC signal, the following values apply. 

SCR1 = 0 

SCR2 < 269561 15 (= SCR3 - Tp (=43885)) 
SCR3 = 27000000 (= 1 sec) 
PTS1 = PTS3 = 93003 (= DTS1 + 3003) 

DTS1 = 90000 (= 1 sec) 

[0121] If the transfer rate is 10.08 Mbps, and the video 
is a PAL signal, the following values apply. 

SCR1 = 0 



SCR2 < 269561 15 (= SCR3 - Tp (= 43885)) 



SCR3 = 27000000 (= 1 sec) 



PTS1 = PTS3 = 93600 (= DTS1 + 3600) 



DTS1 = 90000 (= 1 sec) 

[0122] An operation whereby an MPEG stream hav- 
ing time stamps defined as described above is proc- 

15 essed by an exemplary decoder is described next below 
with reference to Fig. 19 and Fig. 20. Note that this de- 
coder is comprised as shown in Fig. 5. 
[0123] Like Fig. 18, Fig. 19 shows the relationship be- 
tween decoder operation in digital still camera according 

20 to the present invention and the various time stamps 
when photo #1 is reproduced. 

[0124] The video output, still picture #1, and audio 
output, audio #1 , that are output for photo #1 are shown 
in Figs. 1 9(a) and 1 9(b). Figs. 1 9(c) and 1 9(d) show the 

25 change in the data stored to the video buffer 53 and au- 
dio buffer 57 as still picture #1 and audio #1 are decoded 
and output. Fig. 19(e) shows the pack sequence and the 
time stamps (SCR, PTS, DTS) written to each pack of 
the video stream #1 and audio stream #1 , both of which 

30 are MPEG streams, when photo #1 is stored to disc as 
two streams #1 and #2. 

[0125] It should be noted that the packet structure and 
further description thereof are omitted here as in Fig. 1 8. 
[0126] The first part of the description of the reproduc- 
es tion operation of a digital still camera according to the 
present invention starts with the operation for transfer- 
ring the packs of streams #1 and #2 shown in Fig. 1 9(e) 
to the demultiplexer 52. 

[0127] As shown in Fig. 19(e), stream #1 comprises 

40 video pack V1 , video pack V2, video pack V3, and video 
pack V4 multiplexed in sequence from the beginning of 
the stream. Stream #2 likewise comprises audio pack 
A1 and audio pack A2 multiplexed in sequence starting 
from the beginning of the stream. It is important to note 

45 here that stream #1 comprises only video packs, and 
stream #2 comprises only audio packs. 
[01 28] The pack header of each pack also contains a 
system clock reference SCR. As shown in Fig. 19(e), 
SCR#1 of video pack V1 in stream #1 is time t1 ; SCR#2 

50 of video pack V2 is time t2; SCR#3 of video pack V3 is 
time t3; and SCR#4 of video pack V4 is time t4. The 
presentation time stamp PTS and decoding time stamp 
DTS are also set in the first video pack in the video 
stream V1. PTS#1 in video pack V1 is time t8, and 

55 DTS#1 is time t6. 

[0129] In this preferred embodiment as described 
above, the value of time t1 , that is, the value of SCR#1 
in the first video pack V1 , is 0. The value of SCR#4 in 
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the last video pack V4 is likewise 27000000 (27 MHz) - 
Tp t where Tp is the pack transfer time described above 
and is 55296 (27 MHz). Assuming that the video data is 
NTSC coded, time t8 of PTS#1 is 93003 (90 kHz), and 
time t6 of DTS#1 is 90000 (90 kHz). 
[0130] The system clock reference SCR#5 of the first 
audio pack A1 in stream #2 is time t7, and SCR#6 of 
audio pack A2 is time t9. A presentation time stamp PTS 
is also set in audio packs A1 and A2 PTS#5 in audio 
pack A1 is time t8, and PTS#6 in audio pack A2 is time 

no. 

[0131] In this preferred embodiment as described 
above, the value of time t7, that is, the value of SCR#5 
in the first audio pack A1, is 27000000 (27 MHz). Time 
t8 of PTS#5 in audio pack A1 is the same as the video 
data PTS, that is, 93003 (90 kHz). 
[0132] The system time clock STC is reset to time t1 , 
the value of SCR#1 in the first video pack V1 , and each 
pack in stream #1 is then input to the demultiplexer 52 
at the time indicated by the SCR of each pack. 
[0133] That is, as shown in Fig. 19(e), the first video 
pack V1 is input to the demultiplexer 52 at time t1 , video 
pack V2 is input at time t2, video pack V3 at time t3, and 
video pack V4 at time t4. 

[0134] The decoding process of a digital still camera 
according to the present invention differs from a conven- 
tional camera as described with reference to Fig. 18 in 
that the system time clock STC of the decoder is not 
reset after all of stream #1 is input, and the packs of 
stream #2 are input continuously to the demultiplexer 
52 at the SCR timing written to each pack. 
[0135] The first audio pack A1 in stream #2 is thus 
input to the demultiplexer 52 at time t7, and audio pack 
A2 is input at time t9. 

[0136] It is important to note here that the SCR#4 of 
the last video pack V4 and the SCR#5 of the first audio 
pack A1 are set to satisfy equation (1 ) above, which can 
thus be restated as follows. 

SCR#4 + Tp < SCR#5 (1) 

[01 37] Continuity between the SCR values of stream 
#1 and stream #2 is thus assured, the interval therebe- 
tween is at least equal to the pack transfer time, and the 
decoder can thus continuously process two streams 
without hanging up. 

[0138] The demultiplexer 52 outputs video packs in- 
put thereto to the video buffer 53, and outputs audio 
packs input thereto to the audio buffer 57. 
[01 39] The second part of the reproduction operation 
of a digital camera according to the present invention 
described below is the data decoding and output oper- 
ation of the video packs output to the video buffer 53. 
[0140] As shown in Fig. 19(c), while there is an ignor- 
able delay between the video packs output from the de- 
multiplexer 52, the video packs are accumulated to the 
video buffer 53 at the SCR timing, that is, at time t1 , t2, 



t3, and t4. Still picture #1 comprises video packs V1 to 
V4. As a result, all video packs constituting still picture 
#1 have been stored to the video buffer 53 once video 
pack V4 has been stored to the video buffer 53. As 

5 shown in Fig. 19(e), the decoding time stamp DTS of 
still picture #1 comprising video packs V1 to V4 is time 
t6. The data accumulated to the video buffer 53 is there- 
fore decoded by video decoder 54 at time t6, and the 
data is cleared from the video buffer, thereby increasing 

10 the available buffer capacity. 

[01 41] The decoded video pack data of still picture #1 
is an I picture. The decoded I picture is stored to re-or- 
dering buffer 55, and is output from the decoder at PTS 
time t8. 

15 [0142] The third part of the reproduction operation of 
a digital camera according to the present invention de- 
scribed below is the relationship between the time 
stamps and the operation whereby audio pack data out- 
put to the audio buffer 57 is decoded and output. 

20 [0143] As shown in Fig. 1 9(d), the audio packs output 
from the demultiplexer 52 are stored to the audio buffer 
57 at time t7 and t9, thus increasing the amount of data 
stored to the audio buffer 57. Unlike the video data, the 
PTS and DTS are the same in the audio data. As a re- 

25 suit, audio data is output at the same time the audio de- 
coder 58 [57, sic, and below] decodes the audio pack 
data. More specifically, the audio pack A1 data stored 
to audio buffer 57 is decoded by audio decoder 58 at 
the presentation time stamp PTS, i.e. , time t8, and audio 

30 output begins. The audio pack A2 data stored to the au- 
dio buffer 57 at time t9 is then decoded and output at 
the PTS, that is, time t10, by audio decoder 58. 
[0144] It is important to note here that the PTS is the 
same in the still picture data stream #1 and the audio 

35 data stream #2. As a result, stream #1 and stream #2 
are input to the decoder at different times, but are output 
at the same time, which is determined by the PTS. 
[0145] It will therefore be obvious that insofar as the 
time stamps are within the limits defined above, an 

40 MPEG stream comprising only still picture data, and an 
MPEG stream comprising only audio data, can be proc- 
essed continuously, one following the other, by a decod- 
er, with the audio and video presentation occurring si- 
multaneously. 

45 [0146] It will also be obvious that by recording the still 
picture data MPEG stream and audio data MPEG 
stream separately to disk, the audio to be reproduced 
with a particular still picture can be freely and easily 
changed and edited after the still picture data is captured 

so and recorded. 

[0147] Assume, for example, that still picture #1 and 
audio #1 described above with reference to Fig. 19 are 
the data recorded to disk when the image was photo- 
graphed. To later change the audio that is to be output 

55 simultaneously with the still picture #1 , it is only neces- 
sary to record an MPEG stream encoded with time 
stamps derived from equations (1) and (2). An example 
of this new audio #2 additionally recorded as MPEG 
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stream #3 is shown in Fig. 20, 
[0148] Though not shown in the figures, management 
information indicating what audio data MPEG stream is 
to be reproduced simultaneously with the MPEG stream 
for still picture #1 is also recorded to disk. This manage- 
ment information can then be updated so that the MPEG 
stream for audio #2 is reproduced simultaneously with 
the MPEG stream for still picture #1 instead of the 
MPEG stream for audio #1. 

DVD-RAM description 

[0149] DVD-RAM is described next below as a re- 
cording medium and recording format suitable for re- 
cording MPEG streams as described above. 
[0150] Advances in high density recording technolo- 
gies for rewritable optical discs have expanded their 
range of applications from computer data and music to 
image data. A typical conventional optical disc has a 
guide channel of either lands or grooves formed on the 
signal recording surface of the disc. This has meant that 
signals are recorded using only the lands or the grooves. 
The development of a land and groove recording meth- 
od, however, has enabled signals to be recorded to both 
lands and grooves, thus approximately doubling the re- 
cording density of the disc. 

[0151] Constant linear velocity (CLV) control is also 
an effective means of improving recording density, and 
the development of a zoned CLV control method has 
made it easier to implement CLV control. 
[0152] How to utilize these high capacity optical discs 
to record AVdata, including video and other image data, 
and achieve new products with features and functions 
far surpassing those of conventional AV products is a 
major concern for the industry. 
[0153] It is also thought that the availability of large 
capacity, rewritable optical disc media will result in the 
primary medium for recording and reproducing AV ma- 
terials changing from conventional tape media to optical 
disc media. The change from tape to disc media will also 
have wide-ranging effects on the functions and perform- 
ance of AV equipment. 

[01 54] One of the greatest benefits to be gained from 
a switch from tape to disc media is a significant increase 
in random access performance. While it is possible to 
randomly access tape media, several minutes may be 
required to access a particular part of the tape due to 
the need to fast-forward andlor rewind in a linear fash- 
ion. When compared with the seek time of optical disc 
media, which is typically on the order of several 10 mil- 
liseconds, there is an obvious and significant improve- 
ment in random access performance achieved by a 
switch to disc media. Tape is therefore obviously unsuit- 
able as a random access medium. 
[0155] Random access also means that distributed 
(that is, non -contiguous) recording of AV material is pos- 
sible with optical disc media, though impossible with 
conventional tape media. 



Logic structure of DVD-RAM media 

[0156] The logic structure of DVD-RAM media is de- 
scribed next below with reference to Fig. 8. Fig. 8(a) 
5 shows the directory file and the organization of the disc 
recording area. 

[0157] The recording area of the optical disc is ar- 
ranged into a plurality of physical sectors in a spiral pat- 
tern from the inside circumference to the outside circum- 

10 ference of the disc. 

[01 58] The physical sectors of the disc are further al- 
located to one of three areas from the inside circumfer- 
ence to the outside circumference of the disc. A lead-in 
area is located at the inside circumference area of the 

15 disc. A lead-out area is located at the outside circumfer- 
ence area of the disc. A data area is provided between 
the lead-in and lead-out areas. 
[01 59] Each sector also has an address segment and 
a data segment The address segment stores address 

20 information specifying the location of that sector on the 
optical disc, and an identifier identifying whether the 
sector is in the lead-in, data, or lead-out area. Digital 
data is stored to the data segment. 
[01 60] The data segment of sectors in the lead-in area 

25 contains information for initializing the device used to 
reproduce data from the disc (reproduction device). This 
information typically includes a reference signal re- 
quired for servo stabilization, and an ID signal for differ- 
entiating one disc from another. 

30 [0161] The data segment of sectors in the data area 
records the digital data constituting the application [? or 
user data ?] stored to the disc. 
[0162] The lead-out area identifies the end of the re- 
cording area for the reproduction device. 

35 [0163] Management information for managing disc 
content and constituting the file system is recorded to 
the beginning of the data area. This management infor- 
mation is the volume information. The file system is a 
table of contents for grouping a plurality of disc sectors 

40 into groups, and managing these disc sector groups. A 
DVD-RAM medium according to the present invention 
preferably uses the file system defined in ISO 1 3346. 
[01 64] An optical disc according to this preferred em- 
bodiment has a file directory structured as shown in Fig. 

45 8(a). 

[0165] All data handled by a DVD recording apparatus 
belongs to the VIDEO_RT directory directly under the 
ROOT directory. 

[0166] There are two basic file types handled by a 
50 DVD recording apparatus: a single management infor- 
mation file, and at least one, though typically plural, AV 
files. 

Management information file 

55 

[0167] The content of the management information 
file is described next with reference to Fig. 9 (a). 
[0168] The management information file contains a 
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VOB (video object) table and a PGC (program chain) 
table. A VOB is an MPEG program stream. The program 
chain defines the reproduction order of individual cells. 
A cell is a logic unit for reproduction, and corresponds 
to a particular part or all of a VOB. In other words, a VOB 
is a meaningful unit in an MPEG stream, and the PGC 
is unit reproduced by an MPEG stream reproducing ap- 
paratus. 

[0169] The VOB table records the number of VOBs 
(Number_of_VOBs), and certain information about 
each VOB. This VOB information includes: the name of 
the corresponding AV file (AV_File_Name); the VOB 
identifier (VOBJD); the start address in the AV file 
(VOB_Start_Address); the end start address in the AV 
file (VOB_End_Address); the VOB playback time 
(VOB_P!ayback_Tlme); and stream attributes 
(VOB_ Attribute). 

[0170] The PGC table records the number of PGCs 
(Number_of_PGCs) [Number_of_VOBs, sic], and cer- 
tain information about each PGC. This PGC information 
includes: the number of cells in the PGC 
(Number_of_Cells), and certain cell information. 
[0171] This cell information includes: the correspond- 
ing VOBJD; the cell start time in the VOB 
(Cell_Start_Time); the cell playback time in the VOB 
(CelLPIayback_Time); the address at which cell play- 
back starts in the VOB (Cell_Start_Address) and the ad- 
dress at which cell playback ends (Cell_End_Address); 
an audio flag indicating that there is audio to be repro- 
duced simultaneously with the still picture data 
(Audio_Flag). When Audio_Flag is set to 1, cell exten- 
sion information exists for the related audio data, that 
is, the VOBJD, Cell_Start_Time, CelLPlaybackJTime, 
Cell_Start_Address, and Cell_End_Address. When 
Audio_Flag is reset to 0, cell extension information for 
the related audio data does not exist. 
[0172] It is important here to note the audio flag 
(Audio_Flag), which is used to declare whether or not 
there is audio data to be output simultaneously with the 
still picture. 

AVfile 

[01 73] The AV file structure is described next with ref- 
erence to Fig. 9 (b). 

[01 74] An AV file has at least one, and typically plural, 
VOBs. VOBs are recorded continuously to disc, and 
VOBs associated with a particular AV file are arranged 
contiguously on disc. VOBs in an AV file are managed 
using the VOB info in the management information file. 
When the DVD reproducing apparatus first accesses 
the management information file, it reads the VOB start 
and end addresses, and is thus able to access the VOB. 
[01 75] The logical reproduction unit of the VOB is the 
cell. A cell is a part of the VOB to be reproduced; it may 
correspond to the entire VOB, and can be set as desired 
by the user. These cells make editing simple without ac- 
tually manipulating the AV data. As with a VOB, cell ac- 



cess is managed using the cell information in the man- 
agement information file. A DVD reproducing apparatus 
thus accesses the management information file to read 
the cell start and end address information in order to 

5 access a cell. 

[0176] Cell address information is referenced to the 
VOB, and VOB address information is referenced to the 
AV file. As a result, the DVD reproducing apparatus ac- 
cesses a cell by adding the cell address information to 

10 the VOB address information to calculate the address 
in the AVfile, enabling the DVD reproducing apparatus 
to access the AV file. 

Links between still picture data and audio data 

15 

[01 77] How a still picture and audio are synchronously 
reproduced is described next with reference to Fig. 10. 
[0178] Fig. 10 (a) shows part of the management in- 
formation file described above. As shown in Fig. 10 (a), 

20 cell information for a still picture contains access infor- 
mation (VOBJD, Cell_Start_Time, 
CelLPIaybackJTime, CelLStart_Address, and 
Cell JEnd_Address) for the still picture data and the cor- 
responding audio data. 

25 [01 79] The audio flag (Audio J^ag) declares whether 
there is audio data to be reproduced with the still picture 
data. Therefore, when the audio flag indicates that there 
is audio data to be reproduced with the still picture data, 
the cell also contains access information for the audio 

30 data VOB. 

[0180] A relationship between still picture data and 
audio data is thus established by setting the audio flag 
(Audio_Flag) and declaring the VOB information for the 
audio data. 

35 [0181] Fig. 10(b) shows an AVfile for still picture data 
and audio data. Data stored in a VOB is either still pic- 
ture data or audio data. There is no VOB that contains 
both still picture data and audio data in a multiplexed 
manner. Unlike moving picture data VOBs, in the 

40 present invention, still picture data VOBs comprise only 
a single I picture video frame; an intraf rame compressed 
video image, and audio data VOBs contain only audio 
data. The still picture data and audio data playback con- 
trol information is generated by referring to the cell in- 

45 formation for the still picture data VOBs and audio data 
VOBs, and defining the still picture cell playback order 
from the PGC. 

[0182] It is therefore possible to freely combine still 
picture data and audio data streams by defining the 

so playback order of referenced cells for separately record- 
ed still picture data and audio data. 
[01 83] It should be noted that while this preferred em- 
bodiment has been described as having two VOBs for 
one MPEG stream; one for the video data and the other 

55 for the audio data, the data structure is not limited as 
such as long as the audio data and the video data can 
be separated, and the separated audio data can be re- 
placed with another audio data. 
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[0184] For example, the video data (video stream 
part) and the audio data (audio stream part) can be in- 
corporated in a single VOB. Such an example is shown 
in Fig. 10(c). In this case, the video data ot the still pic- 
ture is stored in the video part, which is located in the 
leading half portion of the VOB, and the audio data is 
stored in the audio part, which is located in the trailing 
half portion of the VOB. Fig. 10(c) shows RTR_STO. 
VROfile, such as shown in Fig. 8(b). 
[0185] It is noted that the first system stream ST1 
shown in Fig. 11 and the video part shown in Fig. 10(c) 
are generally referred to as a video part stream. Simi- 
larly, the second system stream ST2 shown in Fig. 12 
and the audio part shown in Fig. 10(c) are generally re- 
ferred to as an audio part stream. 
[0186] The file structure may also be as shown in Fig. 
8(b). In this case, the VIDEO_RT directory corresponds 
to the DVD_RTR directory, and RTR.IFO, RTR_STO. 
VRO, RTR_STA.VRO, and RTR_MOV.VRO files are 
under the DVD_RTR directory. 
[01 87] The RTR. IFO file corresponds to the manage- 
ment information file. The RTR_STO.VRO and 
RTR_STA.VRO files are related to the still picture data. 
The RTR_STO.VRO file records the still picture data 
(video part) and the audio data (audio part) simultane- 
ously recorded with the still picture data. The RTR_STA. 
VRO file records only the audio data (audio part) edited 
after initial recording. Audio data in the RTR_STA.VRO 
file is recorded with a relationship to still picture data 
recorded in the RTR_STO. VRO file. Moving picture data 
is recorded separately from still picture data in the 
RTR_MOVVRO file. 

Still picture data VOB and audio data VOB 

[0188] As described above with reference to Fig. 11 , 
the time stamps for the still picture data VOBs and audio 
data VOBs are as shown below. 

SCR1 = 0 



SCR2 < 27000000 (27 MHz) - Tp 



SCR3 = 27000000 (27 MHz) 



Tp = 55296 (27 MHz) 



PTS1 = PTS3 = 90000 + Tv 



DTS1 =90000 



Description of a DVD recording apparatus 

[0189] A DVD recording apparatus is described next. 
[0190] Fig. 1 is a block diagram of a DVD recording 

s apparatus. Shown in Fig. 11 are: an optical pickup 11 
for reading data from and writing data to a disc; an error 
correction code (ECC) processor 12; a track buffer 13; 
a switch 14 for changing input to and output from the 
track buffer 13; an encoder 15; and a decoder 16. Ref- 

10 erence numeral 17 is an enlarged view of the disc sur- 
face. 

[0191] As shown in enlarged view 17, the smallest re- 
cording unit for data recorded to a DVD-RAM disc is the 
sector, which holds 2KB. One ECC block contains 16 
15 sectors, and is the unit processed by the ECC processor 
1 2 for error correction. 

[0192] Using a track buffer 13 enables AV data- re- 
corded at non-contiguous locations on the disc to be 
supplied to the decoder without an interruption in the 
20 data stream. This is described below with reference to 
Fig. 2. 

[01 93] Fig. 2 (a) shows the address space on the disc. 
When the AV data is recorded to two separate contigu- 
ous regions, [a1 , a2] and |a3, a4] as shown in Fig. 2 (a), 
25 continuous presentation of the AV data can be main- 
tained while seeking address a3 from a2 by supplying 
data accumulated to the track buffer to the decoder. This 
is illustrated in Fig. 2 (b). 

[0194] When reading AV data from address a1 starts 
30 at time t1 , the data is input to the track buffer with output 
from the track buffer beginning at the same time. There 
is, however, a difference of (Va - Vb) between the track 
buffer input rate Va and the output rate Vb from the track 
buffer. This means that data gradually accumulates in 
35 the track buffer at the rate (Va - Vb). This continues to 
address a2 at time t2. If B(t2) is the amount of data ac- 
cumulated in the track buffer at time t2, the data B(t2) 
stored in the track buffer can be supplied to the decoder 
until reading begins again from address a3 at time t3. 
40 [0195] More specifically, if the amount of data read 
from [a1 , a2] before the seek operation begins is at least 
equal to a predetermined amount, i.e., at least equal to 
the amount of data supplied to the decoder during the 
seek operation, AV data can be supplied without inter- 
ns ruption to the decoder. 

[0196] It should be noted that the still picture data sys- 
tem stream ST1 and audio data system stream ST2 
processed contiguously by the decoder in the present 
invention is not necessarily contiguously recorded to the 
so disc. In the case shown in Fig. 20, for example, there 
are two audio data system streams, streams #2 and #3, 
that can be processed continuously with the still picture 
data system stream ST1 #1 by the decoder. It will be 
obvious that only one of these audio data system 
55 streams can be recorded contiguously to the still picture 
data system stream ST1 on disc, and the other audio 
data system stream ST2 must be recorded at an ad- 
dress that is non-contiguous to stream #1. 
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[0197] A DVD recording apparatus comprised as de- 
scribed above, however, can still supply two non-con- 
tiguous streams to the decoder with no interruption be- 
tween the streams. The decoder can therefore continu- 
ously process two streams, and the operation described 
with reference to Fig. 19 can be assured. 
[0198] !t should be further noted that while the above 
example has addressed reading, that is, reproducing 
data from DVD-RAM, the same principle applies to writ- 
ing, that is, recording data to DVD-RAM. 
[0199] More specifically, insofar as a predetermined 
amount of data is recorded contiguously to DVD-RAM, 
continuous reproduction and recording are possible 
even if the AV data is recorded non-contiguously. 
[0200] Fig. 12 is a block diagram of a DVD recording 
apparatus. 

[0201 ] Shown in Fig. 1 2 are: a user interface 1 201 for 
presenting messages to the user and receiving com- 
mands from the user; a system controller 1 202 for over- 
all system control and management; an input section 

1203, typically a camera and microphone; an encoder 

1204, including a video encoder, audio encoder, and 
system stream encoder; an output section 1205, typi- 
cally comprising a monitor and speaker; a decoder 
1206, including a system stream decoder, audio decod- 
er, and video decoder; a track buffer 1207; and a drive 
1208. 

[0202] The recording operation of a DVD recording 
apparatus thus comprised is described next below with 
reference to the flow charts in Fig, 1 3, Fig. 14, and Fig. 
15. 

[0203] Operation starts when a user command is re- 
ceived by the user interface 1201. The user interface 
1 20 1 passes the user command to the system controller 
1202. The system controller 1202 interprets the user 
command, and appropriately instructs the various mod- 
ules to perform the required processes. Assuming that 
the user request is to capture a still picture and record 
the accompanying audio, the system controller 1202 in- 
structs the encoder 1 204 to encode one video frame and 
encode the audio. 

[0204] The encoder 1204 thus video encodes and 
then system encodes the one video frame sent from the 
input section 1203, thus generating a still picture data 
VOB. The encoder 1 204 then sends this still picture data 
VOBto the track buffer 1207, (S1301) 
[0205] This still picture data VOB encoding process is 
described more specifically below with reference to Fig. 
14. 

[0206] The encoder 1204 first initializes the various 
time stamps. In this example, it resets the system clock 
reference SCR to 0, and initializes the PTS and DTS to 
93003 (90 kHz) and 90000 (90 kHz), respectively. 
(S1401) Note that if PAL video is used, the PTS is ini- 
tialized to 93600 (90 kHz). 

[0207] If still picture data recording is not completed, 
the encoder 1 204 converts the still picture data to a pack 
and packet structure. (S1404) 



[0208] Once the pack and packet structure is gener- 
ated, the encoder 1204 calculates the SCR, DTS, and 
PTS time stamps, and inserts these values to the pack 
and packet stream of still picture data. (S1 405) Note that 
5 the SCR of the first pack is set to the initialization value 
of 0, and the PTS and DTS are set to the initialization 
values of 93003 (90 kHz) and 90000 (90 kHz), respec- 
tively. The SCR of the last pack in the stream is forced 
to a time stamp earlier than 27000000 (27 MHz) minus 
10 the pack transfer time Tp. 

[0209] The encoder 1204 then loops back to S1402, 
and determines whether still picture data recording has 
finished. If it has, the encoder 1204 notifies the system 
controller 1202 that still picture data VOB generation 
15 has been completed. The system controller 1202 then 
controls the drive 1208 to record the still picture data 
VOBs stored to the track buffer 1207 to the DVD-RAM 
disc. (S1403) 

[021 0] It will also be obvious to one with ordinary skill 
20 in the related art that while a DVD recording apparatus 
according to this preferred embodiment of the invention 
records to DVD-RAM disc after all still picture data VOBs 
have been generated, recording can proceed parallel to 
still picture data VOB generation to record the VOBs as 
25 they are generated. 

[0211] Returning to Fig. 1 3, after still picture data en- 
coding is completed, the encoder 1204 determines 
whether there is an audio recording to encode. If there 
is, it begins encoding the audio data sent from the input 
30 section 1 203, and sequentially transfers the generated 
audio data VOBs to the track buffer 1207. (S1302, 
S1303) 

[0212] This audio data encoding process is described 
more specifically below with reference to Fig. 15. 
35 [0213] The encoder 1204 first initializes the SCR and 
PTS time stamps. In this example, it sets the system 
clock reference SCR to 27000000 (27 MHz), and initial- 
izes the PTS to 93003 (90 kHz). Note that if the simul- 
taneously presented still picture is PAL video, the PTS 
40 is initialized to 93600 (90 kHz). (S1 501 ) 

[0214] If audio data recording is not completed, the 
encoder 1204 converts the audio data to a pack and 
packet structure (S1504), and calculates and insets the 
SCR and PTS time stamps (S1505). In this example, 
45 the SCR of the first pack is set to the initialization value 
of 27000000 (27 MHz), and the PTS is set to 93003 (90 
kHz). 

[0215] The encoder 1204 then loops back to S1502, 
and determines whether audio data recording has fin- 
50 ished. If it has, the encoder 1 204 notifies the system 
controller 1202. The system- controller 1202 then con- 
trols the drive 1208 to record the audio data VOBs 
stored in the track buffer 1207 to the DVD-RAM disc. 
(S1503) 

55 [0216] It will also be obvious to one with ordinary skill 
in the related art that while a DVD recording apparatus 
according to this preferred embodiment of the invention 
records to DVD-RAM disc after all audio data VOBs 
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have been generated, recording can proceed parallel to 
audio data VOB generation to record the VOBs as they 
are generated. 

[0217] The DVD recording apparatus continues re- 
cording still picture data and audio data to the DVD-RAM s 
disc using the above-described recording method until 
the user stops stream recording. 
[021 8] A stop recording command from the user is ap- 
plied to the system controller 1202 from the user inter- 
face 1 201 . The system controller 1 202 thus sends a stop 
recording command to the encoder 1 204, and controls 
the drive 1 208 to record the remaining VOBs in the track 
buffer 1207 to the DVD-RAM disc. 
[0219] After completing the above-described se- 
quence, the system controller 1202 generates a man- 
agement information file containing a VOB table and 
PGC table as shown in Fig. 9 (a), and drives the drive 
1208 to record the management information file to the 
DVD-RAM disc. (S1304) 

[0220] Decision diamond S1305 then determines 
whether audio data was recorded. If it was, the audio 
flag (Audio_Flag) is set to 1 in this example (S1306); if 
there was no audio data, the audio flag (Audio_Flag) is 
reset to 0 in this example (S1307). 
[0221 ] The management information is also set to ad- 
just the cell playback time (Cell_Playback_Time) for the 
still picture data and audio data to the audio playback 
time. 

[0222] The recording method according to the present 
invention as described above thus records to DVD-RAM 
disc still picture data and audio data in which the time 
stamps are assigned to predetermined values. 
[0223] The playback (reproducing) apparatus of the 
DVD recording apparatus is described next below with 
reference to Fig. 12 and the flow chart in Fig. 1 6. 
[0224] Operation starts when a user command is re- 
ceived by the user interface 1201. The user interface 

1 20 1 passes the user command to the system controller 
1202. The system controller 1202 interprets the user 
command, and appropriately instructs the various mod- 
ules to perform the required processes. Assuming that 
the user request is to play the disc, the system controller 

1 202 controls the drive 1 208 to read the PGC table con- 
taining the playback order from the management infor- 
mation file. 

[0225] The system controller 1202 then determines 
specific PGC information based on the PGC table read 
from disc. Following the playback order indicated by the 
PGC information, the system controller 1202 reproduc- 
es the corresponding VOBs. More specifically, the PGC 
information contains the cell playback order. Each cell 
contains a VOBJD and VOB start and end address in- 
formation. This cell information is what enables access- 
ing the still picture data VOBs. (S1601) 
[0226] The system controller 1202 then determines 
the state of the audio flag (Audio_Flag) in the still picture 
data cell to be reproduced. (S1602) 
[0227] If the audio flag (Audk>_Flag) is set (= 1), the 



system controller 1202 reads the extended audio VOB 
information, that is, the VOBJD and VOB start and end 
addresses, from the still picture data cell information to 
read both the still picture data VOB and the audio data 
VOB to be simultaneously reproduced. (S1603) 
[0228] As described above, the cell address informa- 
tion is referenced to the- VOB, and VOB address infor- 
mation is referenced to the AVfile. In practice, therefore, 
the VOB address information is added to the cell ad- 
dress information to calculate the address in the AV file 
that is used by the DVD reproducing apparatus to ac- 
cess and read AV data recorded to the DVD-RAM disc. 
(S1604) 

[0229] It should be noted that it the audio flag 
(Audio_Flag) is not set (i.e., is reset to 0), that is, only 
still picture data is to be reproduced with no audio, the 
still picture data is presented for the time indicated by 
the Cell_Playback_Time stored in the management in- 
formation file. 

[0230] The decoder process for continuously 
processing still picture data VOBs and audio data VOBs 
when the audio flag (Audio_Flag) is set (= 1) is de- 
scribed more specifically below. 
[0231] That is, the system controller 1202 first reads 
a still picture data VOB into the track buffer 1 207, and if 
the audio flag (Audio_Flag) is set, instructs the decoder 
1206 to decode the still picture data VOBs during the 
time needed to read the audio data VOB into the track 
buffer 1 207. The decoder 1 206 is instructed to begin de- 
coding as soon as audio data VOB reading starts. The 
decoder 1 206 thus reads MPEG streams stored to the 
track buffer 1207, and passes the decoded data to the 
output section 1205. The output section 1205 outputs 
data received from the decoder 1 206 to the monitor and 
speaker at the presentation time-specified in the data. 
[0232] By thus first reading and decoding still picture 
data as described above, image data and audio data 
can be reproduced synchronized to a specified presen- 
tation time once audio data reading begins. 
[0233] It is important to note here that the decoder 
1 206 is able to process a single still picture and accom- 
panying audio data as a single VOB by constructing still 
picture data VOBs and audio data VOBs as described 
above. 

[0234] It should also be noted that while the present 
invention has been described above with reference to a 
DVD-RAM disc, it can also be used with other types of 
media. The present invention shall therefore not be lim- 
ited to DVD-RAM discs and other types of optical discs. 
[0235] Furthermore, the present invention has been 
described using by way of example an audio stream as 
the stream to be simultaneously reproduced with the still 
picture data system stream ST1 . The invention shall not 
be so limited, however, and other types of information 
that can be output with a still picture data system stream 
ST1 can be alternatively used. For example, a second- 
ary image- system stream comprising bitmapped data 
or text data can also be used. A typical application for 
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such a second image system stream is to provide cap- 
tions or subtitles displayed superimposed on the photo- 
graphed still picture. 

[0236] Yet further, the present invention has been de- 
scribed using the cell as the unit for linking still picture 
data and audio data. Alternatively, one cell could be 
equal to one VOB, and the still picture data and audio 
data could be linked in VOB units. 
[0237] Yet further, the present invention has been de- 
scribed using same the cell playback time 
(Cell__Playback_Time) information in the- still picture da- 
ta and audio data. The cell playback time, however, 
need not necessarily be the same. For example, the au- 
dio data information could be given priority such that 
when the reproducing apparatus reads a different cell 
playback time (CelLPIayback_Time) it ignores the play- 
back information for the still picture data. 
[0238] Yet further, the present invention has been de- 
scribed with the still picture data VOBs and audio data 
VOBs recorded to an AV file separately from other 
VOBs, The present invention does not impose any limits 
on the AV file structure, however, and still picture data 
VOBs and audio data VOBs can be recorded with other 
VOBs in the same AV file. 

Advantages of the invention 

[0239] In an optical disc to which at least still picture 
data and audio data are recorded to separate recording 
areas as MPEG streams having a pack and packet 
structure, the time at which input of the last pack of still 
picture data to the decoder buffer starts (system clock 
reference SCR2), and the time at which input of the first 
pack of audio data to the decoder buffer starts (system 
clock reference SCR3), are recorded by means of the 
present invention to satisfy the equation 

SCR2 + Tp <; SCR3 

where Tp is the time required to transfer one pack to the 
decoder buffer. 

[0240] This makes it possible to decode separately re- 
corded still picture data and audio data system streams 
as though they are a single MPEG stream. 
[0241] In addition, by recording the time at which input 
of the first pack of still picture data to the decoder buffer 
starts (SCR1 ), the time at which input of the last pack of 
still picture data to the decoder buffer starts (SCR2), and 
the time at which input of the first pack of audio data to 
the decoder buffer starts (SCR3), to the following val- 
ues: 

SCR1 =0 



SCR2+Tp < 27000000 (27MHz) 



SCR3 = 27000000 (27MHz) 

still picture data and audio data encoded by different en- 
5 coders can still be decoded as though they are a single 
MPEG stream. 

[0242] Furthermore, by recording the [still picture] da- 
ta presentation start time (PTS1) and audio data pres- 
entation start time (PTS3) as the same values, still pic- 
10 ture data can be presented synchronized to the audio 
data, that is, presentation can begin simultaneously. 
[0243] In addition, by defining the still picture data 
presentation start time (PTS1) and audio data presen- 
tation start time (PTS3) as follows: 

15 

PTS1 = PTS3 = 90000 (90kHz) + Tv 

the decoder can synchronously reproduce even still pic- 

20 ture data and audio data encoded by different encoders. 
[0244] Yet further, by setting an identification flag 
(Audio_Flag) for declaring the presence of audio data 
to be synchronously reproduced in the management in- 
formation of still picture data, an optical disc reproducing 

25 apparatus can determine whether there is audio data to 
be reproduced, and still picture data and audio data can 
thus be synchronously reproduced. 
[0245] Although the present invention has been de- 
scribed in connection with the preferred embodiments 

30 thereof with reference to the accompanying drawings, it 
is to be noted that various changes and modifications 
will be apparent to those skilled in the art. Such changes 
and modifications are to be understood as included with- 
in the scope of the present invention as defined by the 

35 appended claims, unless they depart therefrom. 



Claims 

40 1 . An optical disc that is reproducible by a reproducing 
apparatus having a decoder buffer (53, 57), decod- 
er (54, 58), and output section (55, 56), said optical 
disc having recorded thereon: 

45 management information for managing one or 

more recording area on the disc as a file; 
a video part stream (ST1 ) comprising a plurality 
of units containing still picture data for one pic- 
ture, said units in the video part stream being 
50 recorded as a portion in said file, and 

an audio part stream (ST2) comprising one or 
a plurality of units containing audio data to be 
reproduced with the still picture data, said units 
in the audio part stream being recorded as a 
55 portion in said file; 

wherein said units store time stamp information 
indicative of a time required for a decoding 
process and the time at which the data is to be 



10 



15 



50 



18 



35 



EP0 949 825 B1 



36 



output, 

said time stamp information includes a time 
SCR2 indicative of a time at which the last unit 
in the video part stream is to be input to a de- 
coder buffer (53), and s 
a time SCR3 indicative of a time at which the 
first unit in the audio part stream is to be input 
to a decoder buffer (57), and 
said times SCR2 and SCR3 are defined to sat- 
isfy the following relation: 10 

SCR2 + Tp < SCR3 

where Tp is the time required from the start to 15 
the end of inputting one unit of the video part 
stream (ST1 ) to a decoder buffer, wherein said 
units in the video part stream are aligned in se- 
quence within said file, and wherein said units 
in the audio part stream are aligned in se- 20 
quence within said file. 

2. The optical disc as set forth in claim 1 , wherein the 
time stamp information further includes a time 
SCR1 indicative of a time at which the first unit in 25 
the video part stream is to be input to a decoder 
buffer, and the times SCR1 and SCR2 are defined 
as follow: 

30 

SCR1 =0 

SCR2 + Tp = 27000000/(27MHz) = 1 sec. 

35 

wherein (27 MHz) indicates the count of a 27 MHz 
clock. 



decoding start time DTS1 indicative of a time at 
which a decoder (53) starts decoding the video 
part stream; and 
time DTS1 is defined as: 

DTS1 =90000/(90 kHz) = 1 sec 

wherein (90 kHz) indicates the count of a 90 
kHz clock. 

6. The optical disc as set forth in claim 4, wherein 
times PTS1 and PTS3 are defined by the following 
equation: 

PTS1 = PTS3 = 90000/(90 kHz) + Tv 

where (90 kHz) indicates the count of a 90 kHz 
clock, and Tv is the video data frame period. 

7. The optical disc as set forth in claim 1 , wherein vid- 
eo and audio part stream management information 
(Volume information) is further recorded to the op- 
tical disc, and 

management information for the video part stream 
includes an identification flag (Audio_Flag) for de- 
claring there is audio data to be reproduced syn- 
chronized with the still picture data. 

8. An optical disc recording apparatus for recording a 
system stream containing still picture data and au- 
dio data to be reproduced with the still picture data 
to an optical disc that is reproducible by a reproduc- 
ing apparatus having a decoder buffer (53, 57), de- 
coder (54, 58), and output section (55, 56), said op- 
tical disc recording apparatus comprising: 



3. The optical disc as set forth in claim 1 , wherein time 
SCR3 is defined as: *o 

SCR3 = 27000000 / (27 MHz) = 1 sec. 

4. The optical disc as set forth in claim 1 , wherein the 45 
time stamp information further includes: 

time PTS1 indicative of a time at which the vid- 
eo part stream is output from the output section 
(55, 56); so 
time PTS3 indicative of a time at which the au- 
dio part stream is output from the decoder (58); 
and 

times PTS1 and PTS3 are the same. 

55 

5. The optical disc as set forth in claim 1 , wherein the 
time stamp information further includes: 



an encoder (1204), and 

a system controller (1 202); 

said encoder (1204) generating a video part 

stream (ST1) comprising a plurality of units 

containing still picture data for one picture, and 

an audio part stream (ST2) comprising one or 

a plurality of units containing audio data to be 

reproduced with the still picture data; 

said encoder (1204) storing in said units time 

stamp information indicative of a time required 

for a decoding process and the time at which 

the data is to be output; 

wherein the time stamp information includes a 

time SCR2 indicative of a time at which the last 

unit in the video part stream is to be input to a 

decoder buffer (53), and 

a time SCR3 indicative of a time at which the 

first unit in the audio part stream is to be input 

to a decoder buffer (57), and 

said times SCR2 and SCR3 are defined to sat- 
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isfy the following relation: 

SCR2 + Tp < SCR3 

where Tp is the time required from the start to 
the end of inputting one unit to a decoder buffer, 
wherein each unit recorded on said disc indi- 
cates a contiguous series of data on a track. 

9. The optical disc recording apparatus as set forth in 
claim 8, wherein the encoder further stores as time 
stamp information: 

a time SCR1 indicative of a time at which the 
first unit in the video part stream is input to a 
decoder buffer (53), and 
a time PTS1 indicative of a time at which the 
video part stream is output from the output sec- 
tion (55, 56), 

wherein times SCR1 , SCR2, and PTS1 are de- 
fined as follow: 

SCR1 =0 



SCR2 = 27000000/(27 MHz) - Tp 

PTS1 = 90000 / (90 kHz) + Tv 

where (27 MHz) indicates the count of a 27 MHz 
clock, 

(90 kHz) indicates the count of a 90 kHz clock, 
Tp is the time required to transfer the last unit 
of the video part stream, and 
Tv is the video data frame period. 

10. The optical disc recording apparatus as set forth in 
claim 9, wherein the encoder further stores as time 
stamp information: 

a time PTS3 indicative of a time at which the 
audio part stream is output from the decoder 
(58); and 

times SCR3 and PTS3 are defined as follow: 
SCR3 = 27000000 /(27 MHz) 

PTS3 = 90000 / (90 kHz) + Tv. 

11. The optical disc recording apparatus as set forth in 
claim 8, wherein the system controller generates 
video and audio part stream management informa- 
tion, and stores in the management information for 



the video part stream an identification flag 
(Audio_Flag) for declaring there is audio data to be 
reproduced synchronized with the still picture data. 

5 12. The optical disc recording apparatus as set forth in 
claim 8, wherein the system controller records au- 
dio data reproduction time (Cell_Playback_Time) in 
the management information for the audio part 
stream. 

10 

13. An optical disc recording method for recording a 
system stream containing still picture data and au- 
dio data to be reproduced with the still picture data 
to an optical disc that is reproducible by a reproduc- 
is ing apparatus having a decoder buffer (53, 57), de- 
coder (54, 58), and output section (55, 56), said op- 
tical disc recording method comprising: 

a management information recording step for 
20 recording management information for manag- 

ing one or more recording area on the disc as 
a file; 

a video part stream (ST1 ) recording step for re- 
cording a video part stream (ST1) comprising 

25 a plurality of units containing still picture data 

for one picture, said units in the video part 
stream being recorded as a portion in said file; 
an audio part stream (ST2) recording step for 
recording an audio part stream (ST2) compris- 

30 jng one or a plurality of units containing audio 

data to be reproduced with the still picture data, 
said units in the audio part stream being record- 
ed as a portion in said file; and 
a time stamp information recording step for re- 

35 cording time stamp information indicative of a 

time required for a decoding process and out- 
put to said units; 

wherein the time stamp information includes a 
time SCR2 indicative of a time at which the last 

40 unit in the video part stream is to be input to a 

decoder buffer (53), and 
a time SCR3 indicative of a time at which the 
first unit in the audio part stream is to be input 
to a decoder buffer (57), and 

45 said times SCR2 and SCR3 are defined to sat- 

isfy the following equation: 

SCR2 + Tp = SCR3 

so 

where Tp is the time required from the start to 
the end of inputting one unit to a decoder buffer, 
wherein said units in the video part stream are 
aligned in sequence within said file, and where- 
55 in said units in the audio part stream are aligned 

in sequence within said file. 

14. The optical disc recording method as set forth in 
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claim 1 3, wherein the time stamp information further 
includes: 

a time SCR1 indicative of a time at which the 
first unit in the video part stream is input to a s 
decoder buffer (53), and 
a time PTS1 indicative of a time at which the 
video part stream is output from the output sec- 
tion (55, 56), 

wherein times SCR1 , SCR2, and PTS1 are de- 10 
fined as follow: 

SCR1 = 0 

15 

SCR2 = 27000000 /(27 MHz) - Tp 



PTS1 = 90000 / (90 kHz) + Tv 2Q 

where (27 MHz) indicates the count of a 27 MHz 
clock, 

(90 kHz) indicates the count of a 90 kHz clock, 
Tp is the time required to transfer the last unit 25 
of the video part stream, and 
Tv is the video data frame period. 

15. The optical disc recording method as set forth in 
claim 1 4, wherein the time stamp information further 30 
includes: 

a time PTS3 indicative of a time at which the 
audio part stream is output from the decoder 
(58); and 35 
times SCR3 and PTS3 are defined as follow: 

SCR3 = 27000000/(27 MHz) 

40 

PTS3 = 90000 /(90 kHz) + Tv. 

16. The optical disc recording method as set forth in 
claim 1 3, further comprising: 45 

a management information recording step for 
recording management information for the video 
and audio part streams, and generates in the man- 
agement information for the video part stream an 
identification flag (Audio_Flag) for declaring there so 
is audio data to be reproduced synchronized with 
the still picture data. 

17. The optical disc recording method as set forth in 
claim 16, wherein an audio data reproduction time 55 
(Cell_Playback_Time) is further stored in the man- 
agement information for the audio part stream. 



Patentanspruche 

1 . Optische Disk, die wiedergebbar ist durch eine Wie- 
dergabevorrichtung, mit einem Dekodiererpuffer 
(53, 57), einem Dekodierer (54, 58) und einer Aus- 
gabesektion (55, 56), wobei auf der optischen Disk 
aufgezeichnet ist: 

Verwaltungsinformationen zum Verwalten von 
einem oder mehreren Aufzeichnungsberei- 
chen auf der Disk als eine Datei; 
ein vldeo-Teilstrom (ST1) mit einer Mehrzahl 
von Einheiten mit Stand bilddaten fOr ein Bild, 
wobei die Einheiten in dem vldeo-Teilstrom als 
Abschnitt in der Datei aufgezeichnet sind, und 
ein Audio-Teilstrom (ST2) mit einer oder meh- 
reren Einheiten mit Audiodaten, die mit den 
Stand bilddaten wiederzugeben sind, wobei die 
Einheiten in dem Audio-Teilstrom als Abschnitt 
der Datei aufgezeichnet sind; 
wobei diese Einheiten Zeitmarkeninformatio- 
nen speichern, welche eine fur einen Dekodie- 
rungsvorgang erforderliche Zeit und die Zeit 
angeben, zu welcher die Daten auszugeben 
sind, 

wobei die Zeitmarken information eine Zeit 
SCR2 beinhaltet, welche eine Zeit angibt, zu 
welcher die letzte Einheit in den Video-Teilst- 
rom in einen Dekodiererpuffer (53) einzugeben 
ist, und 

eine Zeit SCR3, welche eine Zeit angibt, zu wel- 
cher die erste Einheit in dem Audio-Teilstrom in 
einen Dekodiererpuffer (57) einzugeben ist, 
und 

wobei die Zeiten SCR2 und SCR3 bestimmt 
sind, urn die folgende Beziehung zu erfullen: 

SCR2 + Tp £ SCR3, 

wobei Tp die vom Anf ang bis zum Ende der Ein- 
gabe einer Einheit des Video-Teilstroms (ST1) 
in einen Dekodiererpuffer erforderliche Zeit ist, 
wobei die Einheiten in dem vldeo-Teilstrom in 
einer Folge innerhalb der Datei ausgerichtet 
sind, und wobei die Einheiten in dem Audio- 
Teilstrom in der Folge innerhalb der Datei aus- 
gerichtet sind. 

2. Optische Disk nach Anspruch 1 , 

bei welcher die Zeitmarken information weiterhin ei- 
ne Zeit SCR1 beinhaltet, welche eine Zeit anzeigt, 
zu welcher die erste Einheit in dem vldeo-Teilstrom 
in einen Dekodiererpuffer einzugeben ist, und die 
Zeiten SCR1 und SCR2 sind wie folgt definiert: 

SCR1 = 0 
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SCR2 + Tp = 27000000/(27 MHz) = 1 s. 

wobei (27 MHz) die Zahlung eines 27 MHz-Taktes 
angibt. 

3. Optische Disk nach Anspruch 1 , 

bei welcher die Zeit SCR3 definiert ist als: 

SCR3 = 27000000/(27 MHz) = 1 s. 

4. Optische Disk nach Anspruch 1 , 

bei welcher die Zeitmarken information weiter- 
hin beinhaltet: 

eine Zeit PTS1 , welche eine Zeit angibt, zu wel- 
cher der Video-Teilstrom von der Ausgabesek- 
tion (55, 56) ausgegeben wird; 
eine Zeit PTS3, welche eine Zeit angibt, zu wel- 
cher der Audio-Teilstrom von dem Dekodierer 
(58) ausgegeben wird; 

die Zeiten PTS1 und PTS3 sind die gleichen. 

5. Optische Disk nach Anspruch 1 , 

bei welcher die Zeitmarken information weiter- 
hin beinhaltet: 

eine Dekodierungsanfangszeit DTS1, welche 
eine Zeit angibt, zu welcher ein Dekodierer (53) 
das Dekodieren des Video-Teilstroms beginnt; 
und 

die Zeit DTS1 ist definiert als: 

DTS1 = 90000 / (90 kHz) = 1 s. 

wobei (90 kHz) die Zahlung eines 90 kHz-Tak- 
tes bezeichnet. 

6. Optische Disk nach Anspruch 4, 

bei welcher die Zeiten PTS1 und PTS3 bestimmt 
sind durch die folgende Gleichung: 

PTS1 = PTS3 = 90000 / (90 kHz) + Tv 

wobei (90 kHz) die Zahlung eines 90 kHz-Taktes 
bezeichnet und Tv die Videodaten-Vollbild-Periode 

ist. 

7. Optische Disk nach Anspruch 1 , 

bei welcher die Video- und Audio-Teilstrom- 
Verwaltungsinformatbn (Lautstarkeinformati- 
on) weiterhin auf der optischen Disk aufge- 
zeichnet ist, und 

die Verwaltungsinformation fur den Video- 



Teilstrom ein Identifizierungs-Flag 
(Audio_Flag) beinhaltet, zum Angeben, dass 
Audiodaten synch ron is iert mit den Standbild- 
daten wiederzugeben sind. 

5 

8. Aufzeichnungsvorrichtung fur eine optische Disk 
zum Aufzeichnen eines Systemstromes mit Stand- 
bilddaten und Audiodaten, die mit den Standbildda- 
ten wiederzugeben sind, auf einer optischen Disk, 
10 die wiedergebbar ist durch eine Wiedergabevor- 
richtung mit einem Dekodiererpuffer (53, 57), einem 
Dekodierer (54, 58), und einer Ausgabesektion (55, 
56), wobei die Aufzeichnungsvorrichtung fur die op- 
tische Disk umfasst: 

75 

einen Kodierer (1204), und 
eine Systemsteuerung (1202); 
wobei der Kodierer (1204) einen Video-Teilst- 
rom (ST1 ) erzeugt, mit einer Mehrzahl von Ein- 

20 heiten mit Standbilddaten fur ein Bild, und 

einen Audio-Teilstrom (ST2) mit einer Oder ei- 
ner Mehrzahl von Einheiten mit Audiodaten, die 
mit den Standbilddaten wiederzugeben sind; 
wobei der Kodierer (1204) in den Einheiten 

25 Zeitmarken informationen speichert, welche ei- 

ne Zeit angeben, die fur einen Dekodierungs- 
vorgang erforderlich ist, und die Zeit, zu wel- 
cher die Daten auszugeben sind; 
wobei die Zeitmarkeninformation eine Zeit 

30 SCR2 beinhaltet, welche eine Zeit angibt, zu 

welcher die letzte Einheit in dem Video-Teilst- 
rom in einen Dekodiererpuffer (53) einzugeben 
ist, und 

eine Zeit SCR3, welche eine Zeit angibt, zu wel- 
35 cher die erste Einheit in dem Audio-Teilstrom in 

einen Dekodiererpuffer (57) einzugeben ist, 
und 

wobei die Zeiten SCR2 und SCR3 definiert 
sind, urn die folgende Beziehung zu erfullen: 

40 

SCR2 + Tp < SCR3 

wobei Tp die vom Beginn bis zum Ende der Ein- 
45 gabe einer Einheit in einen Dekodiererpuffer er- 

forderliche Zeit ist, wobei jede auf der Disk auf- 
gezeichnete Einheit eine fortlaufende Reihe 
von Daten auf einer Spur angibt. 

so 9. Aufzeichnungsvorrichtung fur eine optische Disk 
nach Anspruch 8, bei welcher der Dekodierer wei- 
terhin als Zeitmarkeninformation speichert: 

eine Zeit SCR1 , welche eine Zeit angibt, zu wel- 
55 cher die erste Einheit in dem Video-Teilstrom in 

einen Dekodiererpuffer (53) eingegeben wird, 
und 

eine Zeit PTS1 , welche eine Zeit angibt, zu wel- 
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cher der Video-Teilstrom von dem Ausgabeab- 
schnitt (55, 56) ausgegeben wird, 
wobei die Zeiten SCR1, SCR2 und PTS1 wie 
folgt definiert sind: 

SCR1 = 0 



SCR2 = 27000000 / (27 MHz) - Tp 



PTS1 = 90000 / (90 kHz) + Tv, 

wobei (27 MHz) die Zahlung eines 27 MHz-Tak- 
tes angibt, 

(90 kHz) die Zahlung eines 90 kHz-Taktes an- 
gibt, 

Tp die zum Ubertragen der letzten Einheit des 
Video-Teilstroms erforderliche Zeit ist, und 
Tv die Videodaten-Vollbild-Periode ist. 

10. Aufzeichnungsvorrichtung fur eine optische Disk 
nach Anspruch 9, bei welcher der Kodierer weiter- 
hin als Zeitmarkeninformation speichert: 

eine Zeit PTS3, welche eine Zeit angibt, zu wel- 
cher der Audio-Teilstrom von dem Dekodierer 
(58) ausgegeben wird; und 
die Zeiten SCR3 und PTS3 sind wie folgt defi- 
niert: 

SCR3 = 27000000 / (27 MHz) 



PTS3 = 90000 / (90 kHz) + Tv. 

11. Aufzeichnungsvorrichtung fur eine optische Disk 
nach Anspruch 8, bei welcher die Systemsteuerung 
Video- und Audio-Teilstrom-Verwaltungsinformatio- 
nen erzeugt, und in den Verwaltungsinformationen 
fur den Video-Teilstrom ein Identifizierungs-Flag 
(Audio_Flag) speichert, zum Angeben, dass Au- 
diodaten synchronisiert mit den Standbilddaten 
wiederzugeben sind. 

12. Aufzeichnungsvorrichtung fur eine optische Disk 
nach Anspruch 8, bei welcher die Systemsteuerung 
die Audiodaten-Wiedergabezeit 
(Cell_Playback_Time) in der Verwaltungsinformati- 
on fur den Audio-Teilstrom aufzeichnet. 

13. Aufzeichnungsvertahren fur eine optische Disk zum 
Aufzeichnen eines Systemstromes mit Standbild- 
daten und Audiodaten, die mit den Standbilddaten 
wiederzugeben sind, auf einer optischen Disk, die 
wiedergebbar ist durch eine Wiedergabevorrich- 



tung mit einem Dekodiererpuffer (53, 57), einem 
Dekodierer (54, 58) und einer Ausgabesektion (55, 
56), wobei das Aufzeichnungsvertahren fur die op- 
tische Disk umfasst: 

einen Verwattungsinformations-Aufzeich- 
nungsschritt zum Aufzeichnen von Verwal- 
tungsinformationen zum Verwalten von einem 
Oder mehreren Aufzeichnungsbereichen auf 
der Disk als eine Datei; 
einen Video-Teilstrom (STI)-Aufzeichnungs- 
schritt zum Aufzeichnen eines Video-Teilstro- 
mes (ST1 ) mit einer Mehrzahl von Einheiten mit 
Standbilddaten fur ein Bild, wobei Einheiten in 
dem Video-Teilstrom als ein Abschnitt in der 
Datei aufgezeichnet werden; 
einen Audio-Teilstrom (ST2)-Aufzeichnungs- 
schritt zum Aufzeichnen eines Audio-Teilstro- 
mes (ST2) mit einer Oder mehreren Einheiten 
mit Audiodaten, die mit den Standbilddaten 
wiederzugeben sind, wobei die Einheiten in 
dem Audio-Teilstrom als Abschnitt in der Datei 
aufgezeichnet sind; und 
einem Zeitmarkeninformations-Aufzeich- 
nungsschritt zum Aufzeichnen von Zeitmarken- 
informationen, welche eine fur einen Dekodie- 
rungsvorgang und zum Ausgeben der Einhei- 
ten erforderliche Zeit angeben; 
wobei die Zeitmarkeninformation eine Zeit 
SRC2 beinhaltet, welche eine Zeit angibt, zu 
welcher die letzte Einheit in dem Video-Teilst- 
rom in einen Dekodiererpuffer (53) einzugeben 
ist, und 

eine Zeit SCR3, welche eine Zeit angibt, zu wel- 
cher die erste Einheit in dem Audio-Teilstrom in 
einen Dekodiererpuffer (57) einzugeben ist, 
und 

wobei die Zeiten SCR2 und SCR3 bestimmt 
sind, urn die folgende Gleichung zu erfullen: 

SCR2 + Tp = SCR3 

wobei Tp die vom Anfang zum Ende der Einga- 
be einer Einheit in einen Dekodiererpuffer er- 
forderliche Zeit ist, wobei die Videoeinheiten in 
dem Video-Teilstrom in einer Folge innerhalb 
der Datei ausgerichtet sind, und wobei die Ein- 
heiten in dem Audio-Teilstrom in einer Folge in 
der Datei ausgerichtet sind. 

14. Aufzeichnungsvertahren fur eine optische Disk 
nach Anspruch 13, bei welchem die Zeitmarkenin- 
formation weiterhin beinhaltet: 

eine Zeit SCR1 , die eine Zeit angibt, zu welcher 
die erste Einheit in dem Video-Teilstrom in ei- 
nen Dekodiererpuffer (53) eingegeben wird, 
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und 

eine Zeit PTS1 , die eine Zeit angibt, zu welcher 
der Video-Teilstrom von der Ausgabesektion 
(55, 56) ausgegeben wird, 
wobei die Zeiten SCR1, SCR2 und PTS1 wie 5 
folgt definiert sind: 

SCR1 = 0 

10 

SCR2 = 27000000 / (27 MHz) - Tp 

PTS1 = 90000 / (90 kHz) + Tv, 15 

wobei (27 MHz) die Zahlung eines 27 MHz-Tak- 
tes angibt, 

(90 kHz) gibt die Zahlung eines 90 kHz-Taktes 
an, 20 
Tp ist die zum Ubertragen der letzten Einheit 
des Video-Teilstromes erforderliche Zeit, und 
Tv ist die Videodaten-Vollbild-Periode. 

15. Aufzeichnungsverfahren fur eine optische Disk 25 
nach Anspruch 14, bei welchem die Zeitmarken in- 
formation weiterhin beinhaltet: 

eine Zeit PTS3, welche eine Zeit angibt, zu wel- 
cher der Audio-Teilstrom von dem Dekodierer 30 
(58) ausgegeben wird; und 
Zeiten SCR3 und PTS3, welche wie folgt defi- 
niert sind: 

SCR3 = 27000000 / (27 MHz) 
PTS3 = 90000 / (90 kHz) + Tv. 

40 

16. Aufzeichnungsverfahren fur eine optische Disk 
nach Anspruch 1 3, mit: einem Verwaltungsinforma- 
tions-Aufzeichnungsschritt zum Autzeichnen von 
Verwaltungsinformationen fur die Video- und Audio- 
Teilstrome, und erzeugt in der Verwaltungsinforma- 45 
tion fur den Video-Teilstrom ein identifizierungs- 
Flag (Audio_Flag) zum Angeben, dass dort Au- 
diodaten synchronisiert mit den Standbilddaten 
wiederzugeben sind. 

so 

17. Aufzeichnungsverfahren fur eine optische Disk 
nach Anspruch 16, bei welchem eine Audiodaten- 
Wiedergabezeit (Cell_Playback_Time) weiterhin in 
der Verwaltungsinformation fur den Audio-Teilstrom 
gespeichert ist. ss 



Revendications 

1 . Disque optique qui peut etre reproduit par un appa- 
reil de reproduction comportant une rrtemoire tam- 
pon de ddcodeur (53, 57), un dScodeur (54, 58) et 
une section de sortie (55, 56), ledit disque optique 
comportant, enregisttes sur celui-ci : 

des informations de gestion destinees a gerer 
une ou plusieurs zones d'enregistrement sur le 
disque sous forme d'un fichier, 
un flux de partie video (ST1) comprenant une 
plurality d'unites contenant des donndes d'ima- 
ge fixes pour une image, lesdites unites du flux 
de la partie video etant enregistrSes sous forme 
d'une partie dudit fichier, et 
un flux de partie audio (ST2) comprenant une 
unite" ou une plurality d'unites contenant des 
donnees audio devant etre reproduites avec les 
donnSes d'image fixes, lesdites unites du flux 
de la partie audio etant enregistrees en tant que 
partie dudit fichier, 

dans lequel lesdites unites ntemorisent des in- 
formations dement horodateur indicatives 
d'un temps requis pour un processus de d6co- 
dage et du temps auquel les donnees doivent 
6tre fournies en sortie, 

lesdites informations d'element horodateur 
comprennent un temps SCR2 indicatif du 
temps auquel laderniere unite du flux de ia par- 
tie video doit etre fournie en entree a une me- 
moire tampon du decode ur (53), et 
un temps SCR3 indicatif d'un temps auquel la 
premiere unite du flux de la partie audio doit 
etre fournie en entree a une memoire tampon 
du dScodeur (57), et 

lesdits temps SCR2 et SCR3 sont definis de fa- 
con a satisfaire la relation suivante : 

SCR2 + Tp < SCR3 

ou Tp est le temps necessaire du debut a la fin 
de I'entree d'une unite du flux de la partie video 
(ST1 ) dans une memoire tampon de dScodeur, 
dans lequel lesdites unites du flux de la partie 
video sont align6es en sequence a I'interieur 
dudit fichier, et dans lequel lesdites unites du 
flux de la partie audio sont alignees en sequen- 
ce a rinterieur dudit fichier. 

2. Disque optique selon la revendication 1, dans le- 
quel les informations d'element horodateur com- 
prennent en outre un temps SCR1 indicatif d'un 
temps auquel la premiere unite du flux de la partie 
video doit etre fournie en entree a une memoire 
tampon de decodeur, et les temps SCR1 et SCR2 
sont definis comme suit : 
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SCR1 =0 



SCR2 + Tp = 27 000 000/(27 MHz) = 1 seconde, 

dans lequel (27 MHz) indique le comptage 
d'une horloge a 27 MHz. 

3. Disque optique selon la revendication 1 , dans le- 
quel le temps SCR3 est delini par : 

SCR3 = 27 000 000/(27 MHz) = 1 seconde. 

4. Disque optique selon la revendication 1 , dans le- 
quel les informations de ltel§ment horodateur com- 
prennent en outre : 

un temps PTS1 indicatif d'un temps auquel le 
flux de la partie video est fourni en sortie depuis 
la section de sortie (55, 56), 
un temps PTS3 indicatif d'un temps auquel le 
flux de la partie audio est fourni en sortie a partir 
du decodeur (58), et 

les temps PTS1 et PTS3 sont les memes. 

5. Disque optique selon la revendication 1, dans le- 
quel les informations de Itetement horodateur com- 
prennent en outre : 

un temps de debut de decodage DTS1 indicatif 
d'un temps auquel un decodeur (53) debute le 
decodage du flux de la partie video, et 
le temps DTS1 est d6fini par : 

DTS1 = 90 000/(90 kHz) = 1 seconde 

dans lequel (90 kHz) indique le comptage 
d'une horloge a 90 kHz. 

6. Disque optique selon la revendication 4, dans le- 
quel les temps PTS1 et PTS3 sont definis par 
liquation suivante : 

PTS1 = PTS3 = 90 000/(90 kHz) + Tv 

ou (90 kHz) indique le comptage d'une horlo- 
ge a 90 kHz et Tv represente la periode d'une trame 
de donnees video. 

7. Disque optique selon la revendication 1, dans le- 
quel les informations de gestion du flux des parties 
video et audio (informations de volume) sont en 
outre enregistrSes sur le disque optique, et 

les informations de gestion pour le flux de la 



partie video comprennent un indicateur d'identifica- 
tion (indicateur audio) destine a declarer s'il y a des 
donnees audio a reproduire de facon synchronised 
avec les donndes d'image fixes. 

5 

8. Appareil d'enregistrement de disque optique desti- 
ne a enregistrer un flux de systeme contenant des 
donnSes d'image fixes et des donnees audio devant 
etre reproduces avec les donndes d'image fixes sur 

10 un disque optique qui peut §tre reproduit par un ap- 
pareil de reproduction comportant une memoire 
tampon de decodeur (53, 57), un decodeur (54, 58), 
et une section de sortie (55, 56), ledit appareil d'en- 
registrement de disque optique comprenant : 

15 

un codeur (1024), et 
un controleur de systeme (1 202), 
ledit codeur (1204) g6n6rant un flux de partie 
video (ST1) comprenant une plurality d'unites 
20 contenant des donnees d'image fixes pour une 

image, et 

un flux des parties audio (ST2) comprenant une 
unite ou une plurality d'unites contenant des 
donnees audio devant Stre reproduites avec les 

25 donnees d'image fixes, 

ledit codeur (1204) rrtemorisant dans lesdites 
unites des informations d'6l6ment horodateur 
indicatives d'un temps necessaire a un proces- 
sus de decodage et du temps auquel les don- 

30 nees doivent §tre fournies en sortie, 

dans lequel les informations d'element horoda- 
teur comprennent un temps SCR2 indicatif d'un 
temps auquel la dernidre unite du flux de la par- 
tie video doit §tre appliqu6e en entree a une 

35 memoire tampon de decodeur (53), et 

un temps SCR3 indicatif d'un temps auquel la 
premiere unite du flux de la partie audio doit 
etre appliqu6e en entree a une memoire tam- 
pon de decodeur (57), et 

40 lesdits temps SCR2 et SCR3 sont definis de fa- 

con a satisfaire la relation suivante : 

SCR2 + Tp < SCR3 

45 

ou Tp est le temps necessaire du debut jusqu'a 
la fin de I'entr6e d'une unite dans une nrtemoire 
tampon de decodeur, dans lequel chaque unite 
enregistree sur ledit disque indique une serie 
so contigue de donnees sur une piste. 

9. Appareil d'enregistrement de disque optique selon 
la revendication 8, dans lequel le codeur memorise 
en outre en tant qu'informations d'6lement 

55 horodateur : 

un temps SCR1 indicatif d'un temps auquel la 
premiere unite dans le flux de la partie video 
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est appliquee en entree a une mSmoire tampon 
de decodeur (53), et 

un temps PTS1 indicatif d'un temps auquel le 
flux de la partie video est tourni en sortie depuis 
la section de sortie (55, 56), 
dans lequel les temps SCR1 , SCR2 et PTS1 
sont definis comme suit : 

SCR1 = 0, 



SCR2 = 27 000 000/(27 MHz) -Tp 



PTS1 = 90 000/(90 kHz) + Tv 

ou 27 MHz indique le comptage d'une horloge 
a 27 MHz, 

90 kHz indique le comptage d'une horloge k 90 
kHz, 

Tp represente le temps necessaire pour trans- 
ferer la derni&re unite du flux de la partie video, 
et Tv represente la periode d'une trame de don- 
nees video. 

10. Appareil d'enregistrement de disque optique selon 
la revendication 9, dans lequel le codeur memorise 
en outre en tant qu'informations d'eiement 
horodateur : 

un temps PTS3 indicatif d'un temps auquel le 
flux de la partie audio est fourni en sortie depuis 
le decodeur (58), et 

les temps SCR3 et PTS3 sont definis comme 
suit : 

SCR3 = 27 000 000/(27 MHz) 



PTS3 = 90 000/(90 kHz) + Tv. 

11. Appareil d'enregistrement de disque optique selon 
la revendication 8, dans lequel le controleur du sys- 
teme g6n6re des informations de gestion de flux 
des parties video et audio, et memorise -dans les 
informations de gestion pour le flux de la partie vi- 
deo un indicateur d'identification (indicateur audio) 
afin de declarer s'il y a des donnees audio a repro- 
duce de facon synchronisee avec les donnees 
d' image fixes. 

12. Appareil d'enregistrement de disque optique selon 
la revendication 8, dans lequel le controleur du sys- 
tfeme enregistre un temps de reproduction de don- 
nees audio (temps de restitution de cellule) dans 
les informations de gestion pour le flux de la partie 



audio. 

1 3. Proc6de d'enregistrement de disque optique desti- 
ne k enregistrer un flux de systeme contenant les- 

5 dites donn6es d'image fixes et donn6es audio de- 
vant etre reproduites avec les donnees d'image 
fixes sur un disque optique qui peut etre reproduit 
grSce a un appareil de reproduction comportant une 
mdmoire tampon de decodeur (53, 57), un deco- 

10 deur (54, 58), et une section de sortie (55, 56), ledit 
precede d'enregistrement de disque optique 
comprenant : 

une etape d'enregistrement d'informations de 
is gestion destinee k enregistrer des informations 

de gestion en vue de gerer une ou plusieurs 
zones d'enregistrement sur le disque sous for- 
me d'un fichier, 

une etape d'enregistrement de flux de partie vi- 
20 deo (ST1 ) destin6e a enregistrer un flux de par- 

tie video (ST1) comprenant une plurality d'uni- 
tes contenant les donnees d'image fixes pour 
une image, lesdites unites du flux de la partie 
video etant enregistrees sous forme d'une par- 
25 tie dud it fichier, 

une etape d'enregistrement de flux de partie 
audio (ST2) destinee a enregistrer un flux de 
partie audio (ST2) comprenant une unite ou 
une plurality d'unites contenant des donnees 
30 audio devant etre reproduites avec les donnees 

d'image fixes, lesdites unites du flux de la partie 
audio etant enregistr§es sous forme d'une par- 
tie dudit fichier, et 

une etape d'enregistrement d'informations 
35 d'6l6ment horodateur destinee k enregistrer 

des informations d'element horodateur indica- 
tives d'un temps n6cessaire k un processus de 
decodage et fournies en sortie auxdites unites, 
dans lequel les informations d'dtement horoda- 
40 teur comprennent un temps SCR2 indicatif d'un 

temps auquel la derni6re unite du flux de la par- 
tie video doit etre fournie en entree k une me* 
moire tampon de decodeur (53), et 
un temps SCR3 indicatif d'un temps auquel la 
45 premiere unite du flux de la partie audio doit 

etre fournie en entree k une memoire tampon 
de decodeur (57), et 

lesdits temps SCR2 et SCR3 sont definis de fa- 
con k satisfaire liquation suivante : 

so 

SCR2 + Tp = SCR3 

ou Tp est le temps necessaire entre le debut et 
55 la fin de I'entree d'une unite dans une m6moire 

tampon de decodeur, dans lequel lesdites uni- 
tes du flux de la partie video sont alignees en 
sequence a I'int6rieur dudit fichier, et dans le- 
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que! lesdites unites du fiux de la partie audio 
sont alignees en sequence a I'interieur dudit fi- 
chier. 

14. Precede d'enregistrement de disque optique selon 
la revendication 13, dans lequel les informations 
d'element horodateur comprennent en outre : 

un temps SCR1 indicatif d'un temps auquel la 
premiere unite du flux de la partie video est 
fournie en entree a une memoire tampon de de- 
codeur (53), et 

un temps PTS1 indicatif d'un temps auquel le 
flux de la partie video est fourni en sortie depuis 
la section de sortie (55, 56), 
dans lequel les temps SCR1 , SCR2 et PTS1 
sont definis comme suit : 

SCR1 = 0 



SCR2 = 27 000 000/(27 MHz) - Tp 



PTS1 = 90 000/(90 kHz) + Tv 

ou (27 MHz) indique le comptage d'une horloge 
a 27 MHz, 

(90 kHz) indique le comptage d'une horloge a 
90 kHz, 

Tp est le temps necessaire pour transferer la 
derniere unite du flux de la partie video, et 
Tv represente la periode d'une trame de don- 
nees video. 

15. Procede d'enregistrement de disque optique selon 
la revendication 14, dans lequel les informations 
d'element horodateur comprennent en outre : 

un temps PTS3 indicatif d'un temps auquel le 
flux de la partie video est fourni en sortie depuis 
le decodeur (58), et 

les temps SCR3 et PTS3 sont definis comme 
suit : 

SCR3 = 27 000 000/ (27 MHz) 



PTS3 = 90 000/(90 kHz) + Tv. 

16. Procede d'enregistrement de disque optique selon 
la revendication 13, comprenant en outre : 

une etape d'enregistrement d'informations de 
gestion destinee a enregistrer les informations de 
gestion pour le flux des parties video et audio, et 
qui genere dans les informations de gestion pour le 



flux de la partie video un indicateur d'identification 
(indicateur audio) destine a declarer s'il existe des 
donnees audio a reproduire de facon synch ran isee 
avec les donnees d'image fixes. 

s 

17. Procede d'enregistrement de disque optique selon 
la revendication 16, dans lequel un temps de repro- 
duction de donnees audio (temps de restitution de 
cellule) est en outre memorise dans les informa- 
10 tions de gestion pour le flux de la partie audio. 
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Fig.8(a) 
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Fig. 10(a) 
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Fig. 14 
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Fig. 15 
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Fig.16 
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